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0.1 Preface 

This is a proto-preface. A more complete preface will be written after these notes are 
completed. 

These lecture notes are intended to supplement a graduate level course in condensed matter 
physics. 



Chapter 

Introductory Information 



Instructor : 
Contact : 
Lectures : 
Office Hours: 



Daniel Arovas 



Mayer Hall 5671 / 534-6323 / darovas@ucsd.edu 
Tu Th / 9:30 am - 10:50 am / Mayer Hall 5301 
W 2:00 pm - 3:30 pm / Mayer Hall 5671 



A strong emphasis of this class will be on learning how to calculate. I plan to cover the 
following topics this quarter: 

Transport: Boltzmann equation, transport coefficients, cyclotron resonance, magnetore- 
sistance, thermal transport, electron-phonon scattering 

Mesoscopic Physics: Landauer formula, conductance fluctuations, Aharonov-Bohm ef- 
fect, disorder, weak localization, Anderson localization 

Magnetism: Weak vs. strong, local vs. itinerant, Hubbard and Heisenberg models, spin 
wave theory, magnetic ordering, Kondo effect 

Other: Linear response theory, Fermi liquid theory (time permitting) 

There will be about four assignments and a take-home final examination. I will be following 
my own notes, which are available from the course web site. 
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CHAPTER 0. INTRODUCTORY INFORMATION 



0.1 References 

• D. Feng and G. Jin, Introduction to Condensed Matter Physics (I) 
(World Scientific, Singapore, 2005) 

New and with a distinctly modern flavor and set of topics. Looks good. 

• N, Ashcroft and N. D. Mermin, Solid State Physics 
(Saunders College Press, Philadelphia, 1976) 

Beautifully written, this classic text is still one of the best comprehensive guides. 

• M. Marder, Condensed Matter Physics 
(John Wiley & Sons, New York, 2000) 

A thorough and advanced level treatment of transport theory in gases, metals, semi- 
conductors, insulators, and superconductors. 

• D. Pines, Elementary Excitations in Solids 
(Perseus, New York, 1999) 

An advanced level text on the quantum theory of solids, treating phonons, electrons, 
plasmons, and photons. 

• P. L. Taylor and O. Heinonen, A Quantum Approach to Condensed Matter Physics 
(Cambridge University Press, New York, 2002) 

A modern, intermediate level treatment of the quantum theory of solids. 

• J. M. Ziman, Principles of the Theory of Solids 
(Cambridge University Press, New York, 1979). 

A classic text on solid state physics. Very readable. 
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• C. Kittel, Quantum Theory of Solids 
(John Wiley k Sons, New York, 1963) 

A graduate level text with several detailed derivations. 

• H. Smith and H. H. Jensen, Transport Phenomena 
(Oxford University Press, New York, 1989). 

A detailed and lucid account of transport theory in gases, liquids, and solids, both 
classical and quantum. 

• J. Imry, Introduction to Mesoscopic Physics 
(Oxford University Press, New York, 1997) 

• D. Ferry and S. M. Goodnick, Transport in Nanostructures 
(Cambdridge University Press, New York, 1999) 

• S. Datta, Electronic Transport in Mesoscopic Systems 
(Cambridge University Press, New York, 1997) 

• M. Janssen, Fluctuations and Localization 
(World Scientific, Singapore, 2001) 

• A. Auerbach, Interacting Electrons and Quantum Magnetism 
(Springer- Verlag, New York, 1994) 

• N. Spaldin, Magnetic Materials 
(Cambridge University Press, New York, 2003) 

• A. C. Hewson, The Kondo Problem to Heavy Fermions 
(Springer- Verlag, New York, 2001) 
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Chapter 1 



Boltzmann Transport 



1.1 References 



• H. Smith and H. H. Jensen, Transport Phenomena 



• N. W. Ashcroft and N. D. Mermin, Solid State Physics, chapter 13. 



• P. L. Taylor and O. Heinonen, Condensed Matter Physics, chapter 8. 



• J. M. Ziman, Principles of the Theory of Solids, chapter 7. 



1.2 Introduction 

Transport is the phenomenon of currents flowing in response to applied fields. By 'current' 
we generally mean an electrical current j, or thermal current j q . By 'applied field' we 
generally mean an electric field £ or a temperature gradient V T. The currents and fields 
are linearly related, and it will be our goal to calculate the coefficients (known as transport 
coefficients) of these linear relations. Implicit in our discussion is the assumption that we 
are always dealing with systems near equilibrium. 
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1.3 Boltzmann Equation in Solids 



1.3.1 Semiclassical Dynamics and Distribution Functions 

The semiclassical dynamics of a wavepacket in a solid are described by the equations 

± - v (k )- l d£ "W fin 
dt ~ Vn W-h dk 

f = >,t)-^(*)xfl(r,t). (1.2) 

Here, n is the band index and e n (k) is the dispersion relation for band n. The wavevector 
is k (hk is the 'crystal momentum'), and e n {k) is periodic under k — > A; + G, where G is 
any reciprocal lattice vector. These formulae are valid only at sufficiently weak fields. They 
neglect, for example, Zener tunneling processes in which an electron may change its band 
index as it traverses the Brillouin zone. We also neglect the spin-orbit interaction in our 
discussion. 

We are of course interested in more than just a single electron, hence to that end let us 
consider the distribution function f n (r,k,t), defined such that 1 

/ k t) ^cPk _ # of electrons of spin a in band n with positions within 
na ' ' (27r) 3 Sr of r and wavevectors within d 3 k of k at time t. 

Note that the distribution function is dimensionless. By performing integrals over the 
distribution function, we can obtain various physical quantities. For example, the current 
density at r is given by 

j(r,t) = -eJ2 j^fna(r,k,t)v n (k) . (1.4) 

The symbol f2 in the above formula is to remind us that the wavevector integral is performed 
only over the first Brillouin zone. 

We now ask how the distribution functions f n <r{ r i k, t) evolve in time. To simplify matters, 
we will consider a single band and drop the indices na. It is clear that in the absence of 
collisions, the distribution function must satisfy the continuity equation, 

g + V-(«/) = 0. (1.5) 

This is just the condition of number conservation for electrons. Take care to note that V 
and u are six-dimensional phase space vectors: 

u = ( x , y , z , k x , k y , k z ) (1.6) 

v = (A, A A A, AM . M 7 ) 

\dx ' dy dz dk x ' dk y ' dk z J 



1 We will assume three space dimensions. The discussion may be generalized to quasi- two dimensional 
and quasi-one dimensional systems as well. 
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Now note that as a consequence of the dynamics (1.1,1.2) that V • u = 0, i.e. phase space 
flow is incompressible, provided that e(k) is a function of k alone, and not of r. Thus, in 
the absence of collisions, we have 

§£ + «-V/ = 0. (1.8) 

The differential operator D t = dt + u ■ V is sometimes called the 'connective derivative'. 
EXERCISE: Show that V ■ u = 0. 

Next we must consider the effect of collisions, which are not accounted for by the semi- 
classical dynamics. In a collision process, an electron with wavevector k and one with 
wavevector k' can instantaneously convert into a pair with wavevectors k + q and k' — q 
(modulo a reciprocal lattice vector G), where q is the wavevector transfer. Note that the 
total wavevector is preserved (mod G). This means that D t f / 0. Rather, we should write 

where the right side is known as the collision integral. The collision integral is in general 
a function of r, k, and t and a functional of the distribution /. As the fc-dependence is 
the most important for our concerns, we will write Z k in order to make this dependence 
explicit. Some examples should help clarify the situation. 

First, let's consider a very simple model of the collision integral, 

f(r,k,t)-f(r,k) 
Ik{f} ~ ' ( } 

This model is known as the relaxation time approximation. Here, f°(r,k) is a static dis- 
tribution function which describes a local equilibrium at r. The quantity r(e(fc)) is the 
relaxation time, which we allow to be energy-dependent. Note that the collision integral in- 
deed depends on the variables (r,k,t), and has a particularly simple functional dependence 
on the distribution /. 

A more sophisticated model might invoke Fermi's golden rule, Consider elastic scattering 
from a static potential U(r) which induces transitions between different momentum states. 
We can then write 

Mf} = -jr ^2 \(k'\U\k)\ 2 (f k ,-f k )5(e k -e k ,) (1.11) 
k'en 

= W jj^y z \U{k-k')\\f k ,-f k )8{e k -e k ,) (1.12) 
n 



where we abbreviate f k = f(r,k,t). In deriving the last line we've used plane wave wave- 
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functions 2 ipk( r ) = exp(ifc • r)/y/V, as well as the result 

r 

T. A ^) = Vj J - v ,A{k) (1.13) 
ken „ 

for smooth functions A(fc). Note the factor of V~ l in front of the integral in eqn. 1.12. 
What this tells us is that for a bounded localized potential U(r), the contribution to the 
collision integral is inversely proportional to the size of the system. This makes sense 
because the number of electrons scales as V but the potential is only appreciable over a 
region of volume oc V°. Later on, we shall consider a finite density of scatterers, writing 

AT. 

U(r) = J2i=*\ P U{r — R { ), where the impurity density n imp = N- /V is finite, scaling as 
V°. In this case U(k — k') apparently scales as V, which would mean I k {f} scales as V, 
which is unphysical. As we shall see, the random positioning of the impurities means that 
the 0(V 2 ) contribution to \U(k — k')\ 2 is incoherent and averages out to zero. The coherent 
piece scales as V, canceling the V in the denominator of eqn. 1.12, resulting in a finite value 
for the collision integral in the thermodynamic limit (i.e. neither infinite nor infinitesimal). 

Later on we will discuss electron-phonon scattering, which is inelastic. An electron with 
wavevector kl can scatter into a state with wavevector k = kl — q mod G by absorption of 
a phonon of wavevector q or emission of a phonon of wavevector — q. Similarly, an electron 
of wavevector k can scatter into the state k' by emission of a phonon of wavevector — q or 
absorption of a phonon of wavevector q. The matrix element for these processes depends 
on k, k! , and the polarization index of the phonon. Overall, energy is conserved. These 
considerations lead us to the following collision integral: 

?k{f,n} = ^^2\g x (k,k')\ 2 {(l - f k ) fw (1 + n q ,x) 6(e k + hu q x - e k ,) 

k' ,A 

+(1 - fk) fk' n- q x 5(e k - huj^ q x - e k >) 

-fk (1 - fk') (1 + n- q x) 5(e k - huj-gx - e k >) 

-fk (1 - fk')n q x5(e k + hjjqx - £fc')| 5 q>k >- k mod G , (1.14) 

which is a functional of both the electron distribution f k as well as the phonon distribution 
n qX . The four terms inside the curly brackets correspond, respectively, to cases (a) through 
(d) in fig. 1.1. 

While collisions will violate crystal momentum conservation, they do not violate conserva- 
tion of particle number. Hence we should have 3 

/ d3r /(2^ Jfc{/} = °- (L15) 
n 

2 Rather than plane waves, we should use Bloch waves ip nk {f) = exp(ife • r) u nk (r), where cell function 
u nk (r) satisfies u nk (r + R) — u nk (v), where R is any direct lattice vector. Plane waves do not contain 
the cell functions, although they do exhibit Bloch periodicity ip nk {r + R) = exp(ifc ■ R) ip nk {v). 

3 If collisions are purely local, then f ^J)S ^fc{/} = at every point r in space. 

n 
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Figure 1.1: Electron-phonon vertices. 



The total particle number, 

N = jd\j-0^ f(r,k,t) (1.16) 
n 

is a collisional invariant - a quantity which is preserved in the collision process. Other 
collisional invariants include energy (when all sources are accounted for), spin (total spin), 
and crystal momentum (if there is no breaking of lattice translation symmetry) 4 . Consider 
a function F(r, k) of position and wavevector. Its average value is 

F(t) = J<PrJ-0pF{r,k)f{r,k,t). (1.17) 
n 

Taking the time derivative, 



4 Note that the relaxation time approximation violates all such conservation laws. Within the relaxation 
time approximation, there are no collisional invariants. 



dF dr dF dk 
dr dt dk dt 



f + Fl k {f} 



(1.18) 
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Hence, if F is preserved by the dynamics between collisions, then 

dF /■.,/■ d 3 k 

~dt 



/ d3r /(03 FTfc{/} ' (L19) 



which says that F(t) changes only as a result of collisions. If F is a collisional invariant, 
then F = 0. This is the case when F = 1, in which case F is the total number of particles, 
or when F = e(k), in which case F is the total energy. 

1.3.2 Local Equilibrium 

The equilibrium Fermi distribution, 

m = { exp (!ffi^) + 1 }- 1 (1 , 0) 

is a space-independent and time-independent solution to the Boltzmann equation. Since 
collisions act locally in space, they act on short time scales to establish a local equilibrium 
described by a distribution function 



(1.21) 



This is, however, not a solution to the full Boltzmann equation due to the 'streaming terms' 
r ■ d r + k ■ dk in the convective derivative. These, though, act on longer time scales than 
those responsible for the establishment of local equilibrium. To obtain a solution, we write 

f(r,k,t) = f°(r,k,t) + Sf(r,k,t) (1.22) 

and solve for the deviation Sf(r,k,t). We will assume ji = n(r) and T = T{r) are time- 
independent. We first compute the differential of /°, 

o 



KT df°\ dp (e-fi)dT | de 



de I KT k„T 2 k B T 



from which we read off 



^i^.dr + ^^.dr-^.dk] (123) 
de\dr dr+ T Or dr dk dk ' 
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We thereby obtain 



at 



+ v-VSf 



E 



1 



v x B 



dk 



+ v 



e£ 



VT 



de 



) =l k {f° + 5f} 



(1-26) 

where £ = — V(0 — /u/e) is the gradient of the 'electrochemical potential'; we'll henceforth 
refer to £ as the electric field. Eqn (1.26) is a nonlinear integrodifferential equation in 5f, 
with the nonlinearity coming from the collision integral. (In some cases, such as impurity 
scattering, the collision integral may be a linear functional.) We will solve a linearized 
version of this equation, assuming the system is always close to a state of local equilibrium. 

Note that the inhomogeneous term in (1.26) involves the electric field and the temperature 
gradient V T. This means that 5f is proportional to these quantities, and if they are small 
then 5f is small. The gradient of 5f is then of second order in smallness, since the external 
fields 4> — fi/e and T are assumed to be slowly varying in space. To lowest order in smallness, 
then, we obtain the following linearized Boltzmann equation: 



d6f_ _ e_ d6f_ 
dt he " dk 



e£ + £ —AvT 



de 



(1.27) 



where C Sf is the linearized collision integral; £ is a linear operator acting on 5f (we suppress 
denoting the k dependence of C). Note that we have not assumed that B is small. Indeed 
later on we will derive expressions for high B transport coefficients. 



1.4 Conductivity of Normal Metals 
1.4.1 Relaxation Time Approximation 

Consider a normal metal in the presence of an electric field £ . We'll assume B = 0, V T = 0, 
and also that £ is spatially uniform as well. This in turn guarantees that 5f itself is spatially 
uniform. The Boltzmann equation then reduces to 

d £- d £ev.£=l k {f + 6f}. (1.28) 
We'll solve this by adopting the relaxation time approximation for Ik{f}'- 

Ik{f} = = , (1-29) 

r r 

where r, which may be fc-dependent, is the relaxation time. In the absence of any fields 
or temperature and electrochemical potential gradients, the Boltzmann equation becomes 
5f = —Sf/r, with the solution 5f(t) = Sf(0) exp(— t/r). The distribution thereby relaxes to 
the equilibrium one on the scale of r. 
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Writing £(t) = £ e , we solve 



and obtain 

W)= 4«)|! s -. (1 . 31) 

1 - iooT{e(k)) de 

The equilibrium distribution f°(k) results in zero current, since f°(—k) = f°(k). Thus, 
the current density is given by the expression 

n 

In the above calculation, the factor of two arises from summing over spin polarizations. The 
conductivity tensor is defined by the linear relation j a (oj) = a a p(cj) £^(u). We have thus 
derived an expression for the conductivity tensor, 



%3M = 2e 2 | 



d 3 k T(e(k))v a (k)v l3 (k) ( df 



(2tt) 3 l-iuT(e(k)) V de 



(1.33) 



Note that the conductivity is a property of the Fermi surface. For k B T <C e F , we have 
—df°/de ~ S(e F — e(k)) and the above integral is over the Fermi surface alone. Explicitly, 
we change variables to energy e and coordinates along a constant energy surface, writing 

where dS £ is the differential area on the constant energy surface e(k) = e, and v(k) = 
h' 1 V fc e(k) is the velocity. For T < T F , then, 

a ad( iJ ) = , — 7 / dS*p — — . (1.35) 

For free electrons in a parabolic band, we write e(k) = h 2 k 2 /2m*, so v a (k) = hk a /m*. To 
further simplify matters, let us assume that r is constant, or at least very slowly varying in 
the vicinity of the Fermi surface. We find 

where g(e) is the density of states, 

/d 3 k 
— 5(e-e(k)) . (1.37) 
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The (three-dimensional) parabolic band density of states is found to be 

9{e) = ^ e(e) ' (i - 38) 

where @(x) is the step function. In fact, integrating (1.36) by parts, we only need to know 
about the \fe dependence in g(e), and not the details of its prefactor: 

Jdee g {e)(- d -^ = J de f{e)^{e g{e)) 

= I Jdeg(e)f°(e) = ln, (1.39) 

where n = N/V is the electron number density for the conduction band. The final result 
for the conductivity tensor is 



ne 2 T 5 a( 3 



mr l — vjJT 



(1.40) 



This is called the Drude model of electrical conduction in metals. The dissipative part of 
the conductivity is Re ex. Writing a a p = o5 a p and separating into real and imaginary parts 
a = a' + icr" , we have 

ne 2 T 1 
m* 1 + Ld 2 T 2 
The peak at uj = is known as the Drude peak. 



CT M = — i , ; o 2 • (1-41) 



Here's an elementary derivation of this result. Let p(i) be the momentum of an electron, 
and solve the equation of motion 

d P = _P_ e£e -^t (L42) 
dt t K J 



to obtain 



1 — IUJT 



/ x erf 
p(0) 



e-'/ r . (1.43) 



1 — ioJT _ 

The second term above is a transient solution to the homogeneous equation p + p/r = 0. 
At long times, then, the current j = —nep/m* is 

2 

= £ e ~ lwt . (1.44) 

In the Boltzmann equation approach, however, we understand that n is the conduction 
electron density, which does not include contributions from filled bands. 

In solids the effective mass m* typically varies over a small range: m* ~ (0.1 — 1) m e . The 
two factors which principally determine the conductivity are then the carrier density n and 
the scattering time r. The mobility fj,, defined as the ratio a(u = 0)/ne, is thus (roughly) 
independent of carrier density 5 . Since j = —nev = a£, where v is an average carrier 
velocity, we have v = —[J,£, and the mobility (i = er/m* measures the ratio of the carrier 
velocity to the applied electric field. 



5 Inasmuch as both r and m* can depend on the Fermi energy, fi is not completely independent of carrier 
density. 
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■Kfl(eV) 



Figure 1.2: Frequency-dependent conductivity of liquid sodium by T. Inagaki ei a/, Phys. 
Pev. B 13, 5610 (1976). 



1.4.2 Optical Reflectivity of Metals and Semiconductors 



What happens when an electromagnetic wave is incident on a metal? Inside the metal we 
have Maxwell's equations, 

_ 4-7T . 1 <9Z) , „ f 4ira iu\ „ , , , 

VxiJ=— ? + — — ifcx,B= LE (1.45 

c c ot \ c c / 

1 /-^ 

V x £ = — => ikxE = —B (1.46) 

c at c 

V-£ = V-.B = =► ik-E = ik-B = 0, (1.47) 

where we've assumed // = e = 1 inside the metal, ignoring polarization due to virtual 
interband transitions (i.e. from core electrons). Hence, 

fc2 = _ + __ (T(w) (1.48) 

= -2 + -i i — = £ M -2 ' L49 

<r <r 1 — iwr cr 

where iv p = \J Aime 2 / m* is the plasma frequency for the conduction band. The dielectric 
function, 



e(u) = 1 + — = 1 + 1 



UJ ij 1 1 — ILOT 



(1.50) 
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determines the complex refractive index, N(u>) = e(w), leading to the electromagnetic 
dispersion relation k = N(cj) uj/c. 

Consider a wave normally incident upon a metallic surface normal to z. In the vacuum 
(z < 0), we write 



B(r,t) 



icu 



E(r, t) = E 1 x e ^ z / c e - iujt + E 2 x e -Wc e -*rf 
V xE = E 1 y e Wc e -^ _ ^ ^ g-iwz/cg-iwt 



while in the metal (z > 0), 

B(r,t) 



E(r, t)=E 3 x e ^z/c e -iu,t 
V xE = NE 3 ye iNu)z/c e- iu}t 



(1.51) 
(1.52) 

(1.53) 
(1.54) 



Continuity of £xn gives E\ + E 2 = E 3 . Continuity oi H xh gives Ei — E 2 = N E 3 . Thus, 

£ 2 _ 1 - N £3 _ 2 

~ 1 + N ' ^1 ~ 1 + N 

and the reflection and transmission coefficients are 



(1.55) 



R(u>) 
T(u) 



E 2 



E3 



1 - N(w) 



1 + N(uj) 
4 

|l + N(u) 



(1.56) 
(1.57) 



We've now solved the electromagnetic boundary value problem. 



Typical values - For a metal with n = 10 cm and m* = m e , the plasma frequency is 
w p = 5.7 x 10 15 s _1 . The scattering time varies considerably as a function of temperature. In 
high purity copper atT = 4K, r«2x 1(T 9 s and lo p t « 10 7 . At T = 300 K, r » 2 x 10~ 14 s 
and lo p t w 100. In either case, WpT » 1. There are then three regimes to consider. 

• lot <C 1 <C cj p t: 

We may approximate 1 — iwr ss 1, hence 



iY 2 (w) 




1 - 



2^ 



(1.58) 



Hence it! ~ 1 and the metal reflects. 
• 1 <C lot <C cj p t: 



In this regime, 



iVV) « 1 



LOT. 



aw: 



o; 3 r 



(1.59) 
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Photon Energy (eV) 



Figure 1.3: Frequency-dependent absorption of hep cobalt by J. Weaver et at, Phys. Rev. 
B 19, 3850 (1979). 

which is almost purely real and negative. Hence N is almost purely imaginary and 
R « 1. (To lowest nontrivial order, R = 1 — 2/lj p t.) Still high reflectivity 

• 1 <C <jJ v t <C ujt: 



Here we have 

iv 2 H«i-4 => R= i± (1-60) 

to 2lo 
and R <C 1 - the metal is transparent at frequencies large compared to to p . 



1.4.3 Optical Conductivity of Semiconductors 

In our analysis of the electrodynamics of metals, we assumed that the dielectric constant 
due to all the filled bands was simply e = 1. This is not quite right. We should instead 
have written 

fc2 = e ^ + 4^M (161) 
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Photon Energy (eV) 



Figure 1.4: Frequency-dependent conductivity of hep cobalt by J. Weaver et al, Phys. 
Rev. B 19, 3850 (1979). This curve is derived from the data of fig. 1.3 using a Kramers- 
Kronig transformation. A Drude peak is observed at low frequencies. At higher frequencies, 
interband effects dominate. 



where eoo is the dielectric constant due to virtual transitions to fully occupied (i.e. core) 
and fully unoccupied bands, at a frequency small compared to the interband frequency. The 
plasma frequency is now defined as 

/ 47rne 2 

Wp= ~ ; — 



oo) = £oo, although again this 
It turns out that for insulators 

(1.64) 

where cj pv = y / 47rn v e 2 /m e , with n v the number density of valence electrons, and w g is 
the energy gap between valence and conduction bands. In semiconductors such as Si and 
Ge, ujg ~ 4eV, while co pv ~ 16 eV, hence eoo ~ 17, which is in rough agreement with the 
experimental values of ~ 12 for Si and ~ 16 for Ge. In metals, the band gaps generally are 
considerably larger. 

There are some important differences to consider in comparing semiconductors and metals: 



(1.63) 



where n is the conduction electron density. Note that e{LO — 
is only true for u smaller than the gap to neighboring bands, 
one can write 
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• The carrier density n typically is much smaller in semiconductors than in metals, 
ranging from n ~ 10 16 cm~ 3 in intrinsic (i.e. undoped, thermally excited at room 
temperature) materials to n ~ 10 19 cm' 3 in doped materials. 

• eoo ~ 10 — 20 and m*/m c « 0.1. The product e^m* thus differs only slightly from its 
free electron value. 



SinCe "semi £ 10 4 n metal> OIle haS 



^semi^ 10 - 2a; metal_ 10 -14 g _ 



(1.65) 



In high purity semiconductors the mobility ft = er/m* > 10 5 cm 2 /vs the low temperature 
scattering time is typically r ps 10~ n s. Thus, for oj^3 x 10 15 s _1 in the optical range, we 
have ijJT 3> ujpT S> 1, in which case N(u>) « y/e^ and the reflectivity is 



R 



1 



(1.66) 



Taking eoo = 
appear shiny. 



1 + v^o 

10, one obtains R = 0.27, which is high enough so that polished Si wafers 



1.4.4 Optical Conductivity and the Fermi Surface 

At high frequencies, when ujt 3> 1, our expression for the conductivity, eqn. (1.33), yields 

^-^h{-Vlfa\* h) \' (L67) 

where we have presumed sufficient crystalline symmetry to guarantee that a a p = a 5 a p is 
diagonal. In the isotropic case, and at temperatures low compared with T F , the integral 
over the Fermi surface gives 4-7r£:p v P = 127r 3 hn/m* , whence a = ine 2 /m*u>, which is the 
large frequency limit of our previous result. For a general Fermi surface, we can define 

a(w » r- 1 ) = (1.68) 



where the optical mass m opt is given by 

1 1 f ( df° 



m 12ir 3 hn 

'"-opt 



iW* J dS * I'WI ' <L69) 



Note that at high frequencies a(oj) is purely imaginary. What does this mean? If 

£(t) = £ cos(ujt) = \ S (e~ iwt + e +iujt ) (1.70) 
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then 

j(t) = \S {(j{uj) e~ iujt + a{-uj) e +iujt ) 

£ sm(ut) , (1.71) 



2 

ne 



m opt w 

where we have invoked a(—uj) = cj*{uj). The current is therefore 90° out of phase with the 
voltage, and the average over a cycle (j(t) ■ S(t)} = 0. Recall that we found metals to be 
transparent for uj 3> to p S> r _1 . 

At zero temperature, the optical mass is given by 

1 



m 12ir 3 hn 

'"-opt 



^JdS r \v(k)\. (1.72) 



The density of states, g(e F ), is 

from which one can define the thermodynamic effective mass m^ h , appealing to the low 
temperature form of the specific heat, 

c v = ^klTg{e F )^c» v , (1.74) 



where 



c° v = (3. 2 n) 1/3 (1-75) 



is the specific heat for a free electron gas of density n. Thus, 



47r(3 



^73 jdSMW 1 (1-76) 



1.5 Calculation of the Scattering Time 
1.5.1 Potential Scattering and Fermi's Golden Rule 

Let us go beyond the relaxation time approximation and calculate the scattering time r 
from first principles. We will concern ourselves with scattering of electrons from crystalline 
impurities. We begin with Fermi's Golden Rule 6 , 

= Y £ I < k ' I U I k > I 2 to' - /*) ^ £ ( fc ) - £ ( fc ')) > (1-77) 
fc' 



We'll treat the scattering of each spin species separately. We assume no spin-flip scattering takes place. 
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* 

tny 


Jm c 
expt 


* 

m th 

fir,, 

tny 


/m e 
expt 


T \ 
ijl 


1 A ^ 
1.40 


1.0 1 


1 (\A 


9 9^ 


Na 


1.00 


1.13 


1.00 


1.27 


K 


1.02 


1.16 


1.07 


1.26 


Rb 


1.08 


1.16 


1.18 


1.36 


Cs 


1.29 


1.19 


1.75 


1.79 


Cu 






1.46 


1.38 


Ag 






1.00 


1.00 


Au 






1.09 


1.08 



Table 1.1: Optical and thermodynamic effective masses of monovalent metals. (Taken from 
Smith and Jensen). 



where U(r) is a sum over individual impurity ion potentials, 



1 ' imp 

U{r)=Y J U{r-R j ) 

3=1 



\(k'\U\k)\=V- 2 \U(k-k')\ 2 - e i(fc - fc,) -^ 

3=1 

where V is the volume of the solid and 

17(g) = ( d\U{r)e- iqr 



(1.78) 
(1.79) 

(1.80) 



is the Fourier transform of the impurity potential. Note that we are assuming a single species 
of impurities; the method can be generalized to account for different impurity species. 

To make progress, we assume the impurity positions are random and uncorrelated, and we 
average over them. Using 



we obtain 



N- 

3=1 



A^imp ~\~ A; m p( A; m p 1) 8qfl , 



A" imp (A imp - 1) 



(1.81) 



\{k' \U\k)\ z = ^ \U(k - fc')| 2 + impV 7 2 nP ' <W • (1.82) 

EXERCISE: Verify eqn. (1.81). 

We will neglect the second term in eqn. 1.82 arising from the spatial average (q = 
Fourier component) of the potential. As we will see, in the end it will cancel out. Writing 
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f = f + 8f, we have 



X fc {/} = 



(2tt)' 



|£/(fc-fc')| 2 <5 



fr 2 fc 2 h 2 k' 2 



2m* 2m* 



(1.83) 



where ni mp = N- imp /V is the number density of impurities. Note that we are assuming a 
parabolic band. We next make the Ansatz 



8f k = T(e(k))e£-v(k) 



de 



e(fe) 



and solve for r(e(fc)). The (time- independent) Boltzmann equation is 

„ „.df 2vr o /"^A/ |A „ , ,. l2 J h 2 *? h 2 k' 2 ' 

-eS- v(k) -4— = -^n-eS ■ - — \U(k - k') 2 61 

v ' de h imp J (2vr) 3 1 v 71 ^ 2m* 2m* 

Of 



x T(e(k'))v{k 



de 



e(fc') 



-T{e{k))v{k) 



de 



e(k) / 



(1.84) 



(1.85) 



Due to the isotropy of the problem, we must have r(e(k)) is a function only of the magnitude 
of k. We then obtain 7 



hk n 



imp 



m* 4ir 2 h 



-(e(fc)) jdk' k' 2 j dk'\U{k-k 



h 2 k/m* m* 



(k - k') , 



(1.86) 



whence 



m* k F n 



imp 



r(e P ) 



4vr 2 /i 3 



y dk'\U(k F 



k - k F k')\ 2 (1 - k ■ k') 



(1.87) 



If the impurity potential U(r) itself is isotropic, then its Fourier transform U (q) is a function 
of q 2 = 4k 2 sin 2 ^ where cos •& = k k 1 and q = k' — k is the transfer wavevector. Recalling 
the Born approximation for differential scattering cross section, 

we may finally write 



a(0) 



(jI>L\ \U(k-k')\ 2 



(1.88) 



71 

= 2nn- v F I dd (1 — cost?) suit? 

r(e F ) 7 



(1.89) 



where v ¥ = hk F /m* is the Fermi velocity 8 . The mean /ree path is defined by £ = v f t. 



7 We assume that the Fermi surface is contained within the first Brillouin zone. 

s The subscript on <r F (i?) is to remind us that the cross section depends on fc F as well as $. 
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Notice the factor (1 — cosi?) in the integrand of (1.89). This tells us that forward scattering 
(i? = 0) doesn't contribute to the scattering rate, which justifies our neglect of the second 
term in eqn. (1.82). Why should r be utterly insensitive to forward scattering? Because 
r(e F ) is the transport lifetime, and forward scattering does not degrade the current. There- 
fore, a{d = 0) does not contribute to the 'transport scattering rate' r _1 (e F ). Oftentimes 
one sees reference in the literature to a 'single particle lifetime' as well, which is given by 
the same expression but without this factor: 



'sp 



'tr 



2irn imp v F J d0a P (0) | cqs ^ > sintf 
o 



(1.90) 



Note that r sp = (^ imp v F o" Pito t) 1 , where cr Fi tot is the total scattering cross section at energy 
e F , a formula familiar from elementary kinetic theory. 

To derive the single particle lifetime, one can examine the linearized time- dependent Boltz- 
mann equation with £ = 0, 

dSf k 



— 77 

dt imp 



v F jdk' a(ti kk ,) (Sf k > - Sf k ) , 



(1.91) 



where v = hk/m* is the velocity, and where the kernel is = cos 1 (fc • k'). We now 
expand in spherical harmonics, writing 



L,M 



where as before 



Expanding 



a tot = 2irjd'd sin#cr(??) 



6f k (t) = Y,A LM (t)Y LM (k) , 

L,M 

the linearized Boltzmann equation simplifies to 

dA, 

imp 



dt 



+ (1 - u L )n-v F a tot A 



LM 



whence one obtains a hierarchy of relaxation rates, 



(1.92) 

(1.93) 
(1.94) 

(1.95) 
(1.96) 



which depend only on the total angular momentum quantum number L. These rates de- 
scribe the relaxation of nonuniform distributions, when 5fk(t = 0) is proportional to some 
spherical harmonic Y LM (k). Note that r£^ = 0, which reflects the fact that the total 
particle number is a collisional invariant. The single particle lifetime is identified as 



r" 1 = t- 1 
'sp — ' L— >oo 



n imp' U P a tot > 



(1.97) 



corresponding to a point distortion of the uniform distribution. 
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1.5.2 Screening and the Transport Lifetime 

For a Coulomb impurity, with U(r) = —Ze 2 /r we have U(q) = —4-jrZe 2 /q 2 . Consequently, 

" w= fe5is)'- (L98) 

and there is a strong divergence as i? — ► 0, with cr F (i?) oc $~ 4 . The transport lifetime 
diverges logarithmically! What went wrong? 

What went wrong is that we have failed to account for screening. Free charges will rearrange 
themselves so as to screen an impurity potential. At long range, the effective (screened) 
potential decays exponentally, rather than as 1/r. The screened potential is of the Yukawa 
form, and its increase at low q is cut off on the scale of the inverse screening length A -1 . 
There are two types of screening to consider: 

• Thomas-Fermi Screening : This is the typical screening mechanism in metals. A weak 
local electrostatic potential (f>(r) will induce a change in the local electronic density 
according to 5n(r) = ecj)(r)g(e F ), where g(e F ) is the density of states at the Fermi 
level. This charge imbalance is again related to 4>(r) through the Poisson equation. 
The result is a self-consistent equation for cf>(r), 

V 2 = 4ne 5n 

= 4ire 2 g(e F ) <f> = A^f <\> ■ (1.99) 

The Thomas-Fermi screening length is A TF = (47re 2 <7(e p )) 

• Debye-Hiickel Screening : This mechanism is typical of ionic solutions, although it may 
also be of relevance in solids with ultra-low Fermi energies. From classical statistical 
mechanics, the local variation in electron number density induced by a potential 4>{r) 
is 

5n{r) = ne e ^ k B T - n « , (1.100) 

k B T 

where we assume the potential is weak on the scale of k B T/e. Poisson's equation now 
gives us 

\7 2 (j) = 4ire5n 
Amne 2 



k B T 



A D ^. (1.101) 



A screened test charge Ze at the origin obeys 

V 2 (j) = \~ 2 (f)- 4vrZe5(r) , (1.102) 

the solution of which is 

U(r) = -e4>(r) = - Z fe-^ =► U(q) = ■ t 1 " 103 ) 
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The differential scattering cross section is now 

a J#) =(— 1 \ (1.104) 

V 4£ f sm 2 \& + (2fc F A)-2 ) 

and the divergence at small angle is cut off. The transport lifetime for screened Coulomb 
scattering is therefore given by 

— ^— = 2TTn-v F (—) [ dd sintf (1 - costf) | — — 1 J 

r(e P ) imp \4eJ J \sm 2 \$ + (2fc F A)- 2 / 

= 2 -W F (f;) 2 {Ml + *C) " 7^} , d-105) 



with 



C = -A; 2 A 2 = % = k F a* B . (1.106) 



Here a* = eoo fr 2 /m*e 2 is the effective Bohr radius (restoring the €oo factor). The resistivity 
is therefore given by 



p= n ^ = Z 2 \ai n ^- F(k F a* B ), (1.107) 
ne z T e z n 



where 

M ln(1 + <) -r^c 



F(C) = 7 3<{ln(l + 7rC)- T ^ 7 l> • (1.108) 



With h/e 2 = 25,813 51 and a* « a B = 0.529 A, we have 



71 

p = 1.37 x 10~ 4 n ■ cm x Z 2 F(fc p a*) . (1.109) 



Impurity 


Ap per % 


Impurity 


A/9 per % 


Ion 


(//f2-cm) 


Ion 


(/ifi-cm) 


Be 


0.64 


Si 


3.2 


Mg 


0.60 


Ge 


3.7 


B 


1.4 


Sn 


2.8 


Al 


1.2 


As 


6.5 


In 


1.2 


Sb 


5.4 



Table 1.2: Residual resistivity of copper per percent impurity. 
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0.5 1 1.5 2 

<= k F a B 

Figure 1.5: Residual resistivity per percent impurity. 

1.6 Boltzmann Equation for Holes 
1.6.1 Properties of Holes 

Since filled bands carry no current, we have that the current density from band n is 

^3 fn(r, k, t) v n {k) = +2e J ^ f n (r, k, t) v n {k) , (1.110) 

n Ct 

where / = 1 — /. Thus, we can regard the current to be carried by fictitious particles of 
charge +e with a distribution f(r, k,t). These fictitious particles are called holes. 

1. Under the influence of an applied electromagnetic field, the unoccupied levels of a 
band evolve as if they were occupied by real electrons of charge — e. That is, whether 
or not a state is occupied is irrelevant to the time evolution of that state, which is 
described by the semiclassical dynamics of eqs. (1.1, 1.2). 

2. The current density due to a hole of wavevector k is +ev n (k)/V. 

3. The crystal momentum of a hole of wavevector k is P = —hk. 

4. Any band can be described in terms of electrons or in terms of holes, but not both 
simultaneously. A "mixed" description is redundant at best, wrong at worst, and 
confusing always. However, it is often convenient to treat some bands within the 
electron picture and others within the hole picture. 
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l*A> l*B> 

Figure 1.6: Two states: | ) = e k h\. \ ) and | *b ) = e' k h_ k \ ). Which state carries 
more current? What is the crystal momentum of each state? 

It is instructive to consider the exercise of fig. 1.6. The two states to be analyzed are 

l*A> = < fc ^,*l*o> = i^|o> (i-iii) 

|*B> = V' c t , fe V' v ,- fe |*o> = 4^|0> , (1-112) 

where e k = ip]. k is the creation operator for electrons in the conduction band, and h k = ip vk 
is the creation operator for holes (and hence the destruction operator for electrons) in the 
valence band. The state | ^ ) has all states below the top of the valence band filled, and 
all states above the bottom of the conduction band empty. The state | ) is the same state, 
but represented now as a vacuum for conduction electrons and valence holes. The current 
density in each state is given by j = e(v h — v e ) /V, where V is the volume (i. e. length) of the 
system. The dispersions resemble e c ,v ~ ±5-Eg ± h 2 k 2 /2m*, where E g is the energy gap. 

• State | * A }: 

The electron velocity is v c = hk/m*; the hole velocity is v h = —hk/m*. Hence, 
the total current density is j w —2ehk/m*V and the total crystal momentum is 
P = Pc + p h = hk — hk = 0. 

• State | * B ): 

The electron velocity is v c = hk/m*; the hole velocity is v h = —h(—k)/m*. The 
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total current density is j ~ 0, and the total crystal momentum is P = p c + p h = 
kk - h(-k) = 2hk. 

Consider next the dynamics of electrons near the bottom of the conduction band and holes 
near the top of the valence band. (We'll assume a 'direct gap', i.e. the conduction band 
minimum is located directly above the valence band maximum, which we take to be at the 
Brillouin zone center k = 0, otherwise known as the T point.) Expanding the dispersions 
about their extrema, 



Itfmlf 1 ^ kP 



The velocity is 



e v (k) = e 
e c {k) = el + \h 2 ml fi - l k a k" . 

1 de 



v a {k) 



= ±hm-lkP , 



(1.113) 
(1.114) 

(1.115) 



hdk a a(S ' 

where the + sign is used in conjunction with m c and the — sign with m v . We compute the 
acceleration a = r via the chain rule, 



~dkP ' ~dT 



EP+ 1 (vx Bf 

c 

E$ + - (v x Bf 



(1.116) 
(1.117) 



Thus, the hole wavepacket accelerates as if it has charge +e but a positive effective mass. 

Finally, what form does the Boltzmann equation take for holes? Starting with the Boltz- 
mann equation for electrons, 



dt +r dr +k dk~ fcl/l ' 

we recast this in terms of the hole distribution / = ! — /, and obtain 



dt dr dk 



-2fc{l-/} 



(1.118) 



(1.119) 



This then is the Boltzmann equation for the hole distribution /. Recall that we can expand 
the collision integral functional as 



I k {f + Sf} = CSf+... 



(1.120) 



where £ is a linear operator, and the higher order terms are formally of order (5f) 2 . Note 
that the zeroth order term Xfe{/ } vanishes due to the fact that f° represents a local 
equilibrium. Thus, writing / = f° + Sf 



-I k {l - /} = -I k {l - f - Sf} = C5f + .. 



(1.121) 
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and the linearized collisionless Boltzmann equation for holes is 



d6 [_± vxB d6 l_ v 
dt he dk 



eS + '-^VT 



dp 

de 



(1.122) 



which is of precisely the same form as the electron case in eqn. (1.27). Note that the local 
equilibrium distribution for holes is given by 



f°(r,k,t) 



u(r,t)-e(k) 
' ' k B T(r,t) 1 ! 



(1.123) 



1.7 Magnetoresistance and Hall Effect 
1.7.1 Boltzmann Theory for p a/3 (uj,B) 

In the presence of an external magnetic field B, the linearized Boltzmann equation takes 
the form 9 

We will obtain an explicit solution within the relaxation time approximation C5f = —Sf/r 
and the effective mass approximation, 

e(k) = ±±h 2 m- 1 pk a k> 3 => v a = ±hm- 1 pk f3 , (1.125) 

where the top sign applies for electrons and the bottom sign for holes. With £{t) = £ e~ wt , 
we try a solution of the form 

8f(k, t) = k- A(e) e~ iujt = 6f(k) e~ iujt (1.126) 

where A{e) is a vector function of e to be determined. Each component A a is a function of 
k through its dependence on e = e(k). We now have 

(r- 1 - iu) k»A» - - e api v* B? — (k»A») = e v • £ JL , (i.i 27 ) 

where e„ fl „ is the Levi-Civita tensor. Note that 

a/57 



= e a ^v a B?A\ (1.128) 



3 For holes, we replace f° — > /° and 5/ — * 5/. 
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owing to the asymmetry of the Levi-Civita tensor: e a p y v a v"' = 0. Now invoke hk a = 

±m a ^v 13 , and match the coefficients of v a in each term of the Boltzmann equation. This 
yields, 

(r- 1 - iu) m ap ± - e api W\ A? = ±he -±- E* . (1.129) 



Defining 



T a ^(r- 1 -iuj)m ap ±-e aPl B^ , (1.130) 

we obtain the solution 

5f = ev"m a pT-i£~<^-. (1.131) 

From this, we can compute the current density and the conductivity tensor. The electrical 
current density is 

= +2e2£ i^ vavUm ^ r ^ ' (i - i32) 

where we allow for an energy-dependent relaxation time r(e). Note that T a p(e) is energy- 
dependent due to its dependence on r. The conductivity is then 

o ap (u,B) = 2tfe*m-J j H> V (-^) T^(e)\ (1.133) 

n 

oo 

= ± 2 -ezJdeeg(e)r-J(e)(- d £^ , (1.134) 

— oo 

where the chemical potential is measured with respect to the band edge. Thus, 

a a ^,B) = ne 2 (T- 1 p ) , (1.135) 
where averages denoted by angular brackets are defined by 



00 / n \ 

1 dee 9 (e) (-%-) T${e) 
Jdeeg(e) (-^) 

-oo 

The quantity n is the carrier density, 

OO f 
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EXERCISE: Verify eqn. (1.134). 



For the sake of simplicity, let us assume an energy-independent scattering time, or that the 
temperature is sufficiently low that only r(e F ) matters, and we denote this scattering time 
simply by r. Putting this all together, then, we obtain 



= —r a = — 



(r 1 - iuj)m af3 ± - e aM B^ 



(1.138) 
(1.139) 



We will assume that B is directed along one of the principal axes of the effective mass 
tensor m a/3 , which we define to be x, y, and z, in which case 



P«B^i B ) 



' '{t- 1 - iu) m* x ±eB/c 
TeB/c {T~ l -ioj)m* 

(t- 1 - iu) m* z/ 



(1.140) 



where rn xyz are the eigenvalues of m Q/3 and B lies along the eigenvector z. 
Note that 



ne 2 T 



(1 — iujt) 



is independent of B. Hence, the magnetoresistance, 

A Pxx( B ) = Pxx(B) - p xx (0) 



(1.141) 



(1.142) 



vanishes: Ap xx (B) = 0. While this is true for a single parabolic band, deviations from 
parabolicity and contributions from other bands can lead to a nonzero magnetoresistance. 



The conductivity tensor a af3 is the matrix inverse of p a/3 . Using the familiar equality 



a b\ 1 / d -b 

c d) ad - be \-c a J ' 



(1.143) 



we obtain 



( (l-iuT)/m* UcT/^/mZm* 

2j_/,., r\2 



a a 0( UJ ^B) = ne 2 T 



± 



(l-iu,T) 2 + Kr) 2 



where 



(l-^r) 2 + Kr) 2 

(1-wt) / 'ml 
(l-jo;r) 2 H-(a; c r) 2 (l-^r) 2 +Kr) 2 



eB 
m i c 



I ) 



(l—iuJT)m, 



(1.144) 



(1.145) 
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with m* ± = y/m* m*, is the cyclotron frequency. Thus, 



ne 2 r 1 — ilot 



a zz ^,B) = ^- . (1.147) 

Note that o xxm are field-dependent, unlike the corresponding components of the resistivity 
tensor. 



1.7.2 Cyclotron Resonance in Semiconductors 

A typical value for the effective mass in semiconductors is m*~0.1m e . From 

— = 1.75 x 10 7 Hz/G , (1.148) 

m e c 

we find that eB/m*c = 1.75 x 10 11 Hz in a field of B = lkG. In metals, the disorder is 
such that even at low temperatures lo c t typically is small. In semiconductors, however, the 
smallness of m* and the relatively high purity (sometimes spectacularly so) mean that lo c t 
can get as large as 10 3 at modest fields. This allows for a measurement of the effective mass 
tensor using the technique of cyclotron resonance. 

The absorption of electromagnetic radiation is proportional to the dissipative (i.e. real) part 
of the diagonal elements of a af3 (uj), which is given by 

/ , „n ne 2 T 1 + (A 2 + l)s 2 , A . 

^ B) = ^K 1 + 2(A 2 + 1). 2 + (A 2 -1)V ' (L149) 

where A = B/B^, with B w = m* ± cuj/e, and s = lot. For fixed to, the conductivity a' xx (B) 
is then peaked at B = B* . When wt » 1 and lo c t 3> 1, B* approaches B w , where 
a' xx (oj , Bu) = ne 2 T/2m*. By measuring B^ one can extract the quantity m* ± = eB^/uoc. 
Varying the direction of the magnetic field, the entire effective mass tensor may be deter- 
mined. 

For finite lot, we can differentiate the above expression to obtain the location of the cyclotron 
resonance peak. One finds B = (1 + a) 1 / 2 B w , with 

g= -(2^ + 11 + ^ + 1)^-1 

s 2 

As depicted in fig. 1.7, the resonance peak shifts to the left of B^ for finite values of lot. 
The peak collapses to B = when lot < l/v3 = 0.577. 
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Figure 1.7: Theoretical cyclotron resonance peaks as a function of B/B w for different values 



1.7.3 Magnetoresistance: Two-Band Model 

For a semiconductor with both electrons and holes present - a situation not uncommon to 
metals either {e.g. Aluminum) - each band contributes to the conductivity. The individual 
band conductivities are additive because the electron and hole conduction processes occur 
in parallel, exactly as we would deduce from eqn. (1.4). Thus, 



where a a l is the conductivity tensor for band n, which may be computed in either the 
electron or hole picture (whichever is more convenient). We assume here that the two 
distributions Sf c and 5f v evolve according to independent linearized Boltzmann equations, 
i.e. there is no interband scattering to account for. 

The resistivity tensor of each band, p^ exhibits no magnetoresistance, as we have found. 
However, if two bands are present, the total resistivity tensor p is obtained from p^ 1 = 
Pc 1 + Pv 1 , and 



Of LOT. 




(1.151) 



n 




(1.152) 



will in general exhibit the phenomenon of magnetoresistance. 



Explicitly, then, let us consider a model with isotropic and nondegenerate conduction band 
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minimum and valence band maximum. Taking B = Bz, we have 

Pc ()\ 



where 



(1 - iuJT c )m c B 



n c e 2 T c 



1 + 



(1 — iuJT v )m v B 

Pv 



n v e 2 r v 



n v ec 



a n = 



(1 — iuT c )m. 
n c e 2 T c 






1 


°\ 




a c 


-1 








-( 


~Pc 








0/ 




K 





1 


ON 




^a v 










)- 


Pv 












v° 


)m c 






P c 













a c J 



a 



n c ec 



(1.153) 



(1.154) 



(1.155) 



a,. 



(1 — iuJT v )m v 
n v e 2 T v 

we obtain for the upper left 2x2 block of p: 



a. 



+ 



a c 



a% + P 2 a 2 c + (3 2 



+ 



Pv 



Pv 



B 



n v ec 



Pc 



Q 2 + /?2 a 2 + £2 



+ 



Pc 



i Pv /3c ay _i_ a c 

\ C.2+/32 a 2+/3 c 2 a 2+/32 "I" ai+pj 



from which we compute the magnetoresistance 



Pxx(B) - p xx (0) a - a - 


^ Q~c gv N 

Vn c ec n v ec J 


2 


^(°) (cr c + a v ) 2 + ( 


<J c a v ) 2 (^- 

\n c ec 


n v ec/ 



where 



n c e 2 r c 1 
m c 1 — iwTb 



(1.156) 



(1.157) 



(1.158) 



(1.159) 



2 

- ! n v& T v i . . 

cj v = a v = • . (1.160) 

m v 1 — iujT v 

Note that the magnetoresistance is positive within the two band model, and that it saturates 
in the high field limit: 

p xx (B oo) - p M (0) _ Gc av " ^) 



^( ) (CT C C7 V ) 2 (^ + ^) 

\n c ec n v ec J 



2 • 



(1.161) 
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The longitudinal resistivity is found to be 

p zz = (°c + v v r 1 (i-i62) 

and is independent of B. 

In an intrinsic semiconductor, n c = n v oc exp(— E g /2k B T), and Ap xx (B) / p xx (0) is finite 
even as T — ► 0. In the extrinsic (i.e. doped) case, one of the densities (say, n c in a p-type 
material) vanishes much more rapidly than the other, and the magnetoresistance vanishes 
with the ratio n c /n v . 



1.7.4 Hall Effect in High Fields 

In the high field limit, one may neglect the collision integral entirely, and write (at co = 0) 

-ev.S d f-^vxB. d f = 0. (1.163) 
oe nc dk 

We'll consider the case of electrons, and take S = £y and B = Bz, in which case the 
solution is 

Note that fc^ is not a smooth single-valued function over the Brillouin-zone due to Bloch 
periodicity. This treatment, then, will make sense only if the derivative df°/de confines k 
to a closed orbit within the first Brillouin zone. In this case, we have 

i ~2ec £ - f^k (1165) 
ti 

n £ f d 3 k , df° 
= 2ec— 1 



te^- (L166) 



B J (2tt) 

Now we may integrate by parts, if we assume that f° vanishes on the boundary of the 
Brillouin zone. We obtain 

2ec£ f d 3 k nec 

J * = -^Jj2^ f =~^ £ - (L167) 
n 

We conclude that 

ftec 

°xy = -Vyx = -q , (1.168) 

independent of the details of the band structure. "Open orbits" - trajectories along Fermi 
surfaces which cross Brillouin zone boundaries and return in another zone - post a subtler 
problem, and generally lead to a finite, non- saturating magnetoresistance. 

For holes, we have f° = 1 — /° and 



2ec£ f d 3 k , df° nec „ . . 
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Figure 1.8: Nobel Prize winning magnetotransport data in a clean two-dimensional electron 
gas at a GaAs-AlGaAs inversion layer, from D. C. Tsui, H. L. Stormer, and A. C. Gossard, 
Phys. Rev. Lett. 48, 1559 (1982). p xy and p xx are shown versus magnetic field for a set of 
four temperatures. The Landau level filling factor is v = nhc/eB. At T = 4.2 K, the Hall 
resistivity obeys p xy = B/nec (n = 1.3 x 10 n cm~ 2 ). At lower temperatures, quantized 
plateaus appear in p xy (B) in units of h/e 2 . 

and a xy = +nec/B, where n is the hole density. 

We define the Hall coefficient Rn = —p xy /B and the Hall number 

Z U EE ^— , (1.170) 

n ion ecR u 

where n ion is the ion density. For high fields, the off-diagonal elements of both p af3 and 

a a/3 are negligible, and p xy = —a xy . Hence i?H = =Fl/nec, and z n = ±n/n ion . The high 
field Hall coefficient is used to determine both the carrier density as well as the sign of the 
charge carriers; z n is a measure of valency. 

In Al, the high field Hall coefficient saturates at z H = —1. Why is z H negative? As it turns 
out, aluminum has both electron and hole bands. Its valence is 3; two electrons go into a 
filled band, leaving one valence electron to split between the electron and hole bands. Thus 
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Figure 1.9: Energy bands in aluminum. 

n = 3n ion The Hall conductivity is 

a xy = 0\ ~ n c) ec / B ■ (1-171) 

The difference re h — n e is determined by the following argument. The electron density in 
the hole band is n' e = 2n ion — n h , i.e. the total density of levels in the band (two states per 
unit cell) minus the number of empty levels in which there are holes. Thus, 

n h - n e = 2n ion ~ K + O = «ion > (1.172) 

where we've invoked n e + n' e = ni on , since precisely one electron from each ion is shared 
between the two partially filled bands. Thus, a xy = n- lon ec/B = nec/2>B and z H = —1. At 
lower fields, z H = +3 is observed, which is what one would expect from the free electron 
model. Interband scattering, which is suppressed at high fields, leads to this result. 



1.8 Thermal Transport 
1.8.1 Boltzmann Theory 

Consider a small region of solid with a fixed volume A^. The first law of thermodynamics 
applied to this region gives TAS = AE — [iAN. Dividing by AV gives 

dq = T ds = de — fidn , (1.173) 

where s is the entropy density, e is energy density, and n the number density. This can be 
directly recast as the following relation among current densities: 

3 q = Tj s =j £ - V3n , (1-174) 
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Figure 1.10: Fermi surfaces for electron (pink) and hole (gold) bands in Aluminum. 



where j n = e) is the number current density, j £ is the energy current density, 

• 9 f Sk Xt 

h 



(1.175) 



and j s is the entropy current density. Accordingly, the thermal (heat) current density j q is 
defined as 



j q = Tj s =j £ + ^j 

2 l£^{e-n)v8f . 



27T 



(1.176) 
(1.177) 



n 



In the presence of a time-independent temperature gradient and electric field, linearized 
Boltzmann equation in the relaxation time approximation has the solution 



5f = -T(e)v.(eE + £ - Y »VT) (-^ 



(1.178) 



We now consider both the electrical current j as well as the thermal current density j q . 
One readily obtains 



-2e 



d 3 k 



v8f = L U £-L 12 VT 



/d 3 k 
(e-^v5f^L 21 S-L 22 VT 



(1.179) 
(1.180) 
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where the transport coefficients L 11 etc. are matrices: 

□2 r / a f 0\ r .,a,,P 



^^H'A- B ^)h^ (L181) 

Lt = TLt = fdsr(s) (e - „) (-^) /dS e ^ (1.182) 

L - = i^4r - "> 2 ("f) 9f • <L183) 



If we define the hierarchy of integral expressions 



^ s *k H e) (£ - " ) " h 9r (Ll84) 

then we may write 

Lt = e 2 jf Lt = TLt = -e jf V* = \ jf . (1.185) 

The linear relations in eqn. (1.180) may be recast in the following form: 

S = pj + QVT (1.186) 
j q = Hj- K VT, (1.187) 

where the matrices p, Q, n, and k are given by 

p = L u 1 Q = L u 1 L l2 (1.188) 

n = L 21 L 11 1 k = L 22 — L 21 L 12 , (1.189) 

or, in terms of the Jn, 

n = - 1 -J 1 J 1 k = ^(j 2 -J 1 j 1 j 1 ), (1.191) 

The names and physical interpretation of these four transport coefficients is as follows: 

• p is the resistivity: £ = pj under the condition of zero thermal gradient (i.e. VT = 0). 

• Q is the thermopower: E = QVT under the condition of zero electrical current {i.e. 
j = 0). Q is also called the Seebeck coefficient. 

• n is the Peltier coefficient: j q = nj when VT = 0. 

• k is the thermal conductivity: j q = -kVT when j = . 
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T = T 



l 




T=T 



T = T 
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v = (q b -q a )(i\-t ) 



Figure 1.11: A thermocouple is a junction formed of two dissimilar metals. With no electri- 
cal current passing, an electric field is generated in the presence of a temperature gradient, 
resulting in a voltage V = Va — Vb- 

One practical way to measure the thermopower is to form a junction between two dissimilar 
metals, A and B. The junction is held at temperature T± and the other ends of the metals 
are held at temperature To. One then measures a voltage difference between the free ends 
of the metals - this is known as the Seebeck effect. Integrating the electric field from the 
free end of A to the free end of B gives 



What one measures here is really the difference in thermopowers of the two metals. For an 
absolute measurement of Q\, replace B by a superconductor (Q = for a superconductor). 
A device which converts a temperature gradient into an emf is known as a thermocouple. 

The Peltier effect has practical applications in refrigeration technology. Suppose an electrical 
current I is passed through a junction between two dissimilar metals, A and B. Due to 
the difference in Peltier coefficients, there will be a net heat current into the junction of 
W = (FIa — I Ib) I- Note that this is proportional to /, rather than the familiar I 2 result 
from Joule heating. The sign of W depends on the direction of the current. If a second 
junction is added, to make an ABA configuration, then heat absorbed at the first junction 
will be liberated at the second. 10 

10 To create a refrigerator, stick the cold junction inside a thermally insulated box and the hot junction 
outside the box. 



B 




(1.192) 



A 
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B 






\ 





\ = (n B -n A )-i 

Figure 1.12: A sketch of a Peltier effect refrigerator. An electrical current / is passed through 
a junction between two dissimilar metals. If the dotted line represents the boundary of a 
thermally well-insulated body, then the body cools when rie > I~Ia, in order to maintain a 
heat current balance at the junction. 



1.8.2 The Heat Equation 



We begin with the continuity equations for charge density p and energy density e: 

^ + V-j = (1.193) 

de 

- + V-j £ =j-E, (1.194) 

where E is the electric field 11 . Now we invoke local thermodynamic equilibrium and write 

de _ de dn de dT 
at ~ dn 9t + df Ik 



H dp dT 

e at +Cy at 



+ c v — , (1.195) 



where n is the electron number density (n = —p/e) and c v is the specific heat. We may 
now write 



dT de : p dp 

ft 



Cy dt dt + e dt 



j-E-Vj £ -^Vj 



j-S-Vj q . (1.196) 



'Note that it is E ■ j and not £ ■ j which is the source term in the energy continuity equation. 



1.8. THERMAL TRANSPORT 



41 



Invoking j q = nj — kV T, we see that if there is no electrical current (j 
the heat equation 

dT _ d^T 

Cy ~dt~ Kafi dx^dxP ■ 

This results in a time scale t t for temperature diffusion r T = CL 2 c v /k, where L is a typical 
length scale and C is a numerical constant. For a cube of size L subjected to a sudden 
external temperature change, L is the side length and C = 1/3-ir 2 (solve by separation of 
variables). 



= 0), we obtain 
(1.197) 



1.8.3 Calculation of Transport Coefficients 



We will henceforth assume that sufficient crystalline symmetry exists (e.g. cubic symme- 
try) to render all the transport coefficients multiples of the identity matrix. Under such 
conditions, we may write Jn^ = Jn 5 a /3 with 



The low-temperature behavior is extracted using the Sommerfeld expansion, 

oo 

1= jdeH(e) ^- 



de 



ttV csc(ttD) H(e) 



TT 



H( f ,) + -(k B T) 2 H"(i,) + ... 



where T> = k B T ^ is a dimensionless differential operator. 
To quickly derive the Sommerfeld expansion, note that 



12 



df° 



l 



l 



de J k B T [ e ie-n)/k B T + j] ^-e)/k B T + jj ' 

hence, changing variables to x = (e — fi)/k B T, 



J (e* + l)(e- x + 1) J 



(e x + 

— oo 

oo 

2ttz ^2 ^ es 



n=0 



(e x + \)(e~ x + 1) 



(e x + l)(e~ x + 1 
H{e) 



H{e) 



x=(2n+l)iir 



e=fi 



(1.198) 



(1.199) 
(1.200) 



(1.201) 



(1.202) 



Remember that physically the fixed quantities are temperature and total carrier number density (or 
charge density, in the case of electron and hole bands), and not temperature and chemical potential. An 
equation of state relating n, /j,, and T is then inverted to obtain /j,(n,T), so that all results ultimately may 
be expressed in terms of n and T. 
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where we treat T> as if it were c-number even though it is a differential operator. We have 
also closed the integration contour along a half-circle of infinite radius, enclosing poles in 
the upper half plane at x = (2n + l)iir for all nonnegative integers n. To compute the 
residue, set x = (2n + + e, and examine 



e (2n+l)M-X> & eV 
(l_ e e )(1 _ e -e) 



1 + eV + \e 2 V 2 + 
e 2 + ie 4 + ... 



3 (2n+l)i7rX> 



= {-^-7 + (i^-^ 2 ) + ^ 



= (2n+l)i7rX> 



(1.203) 



We conclude that the residue is —V e ( 2n + 1 ) m ' D _ Therefore, 

oo 

1 = -2iriV ^ e {2n+1)inV H{e) 

?1=0 

= irVcsc(itV) H(e) 



e=/i 



(1.204) 



which is what we set out to show. 



Let us now perform some explicit calculations in the case of a parabolic band with an 
energy-independent scattering time r. In this case, one readily finds 



J n = ^ ^-3/2 ^ cgc nV £ 3/2 (£ _ 



e=/i 



where a = ne 2 r/m*. Thus, 



Jo 
Ji 

J,. 



1 vr 2 (k B T) 2 

1 + — v V +... 



8 /i 2 



a vr 2 (fc B T)< 



s 2 2 



+ . . • 



from which we obtain the low-T results p = <7 x , 



vr 2 fc|T 



7T 2 TIT 

3 m* 



and of course n = TQ. The predicted universal ratio 

4- = ^ (k B /e) 2 = 2.45 x 10~ 8 V 2 KT 2 



(1.205) 

(1.206) 
(1.207) 
(1.208) 

(1.209) 

(1.210) 



is known as the Wiedemann- Franz law. Note also that our result for the thermopower 
is unambiguously negative. In actuality, several nearly free electron metals have positive 
low-temperature thermopowers (Cs and Li, for example). What went wrong? We have 
neglected electron-phonon scattering! 
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TEMPERATURE IN DEGREES KELVIN 

Figure 1.13: QT product for p-type and n-type Ge, from T. H. Geballe and J. W. Hull, 
Phys. Rev. 94, 1134 (1954). Samples 7, 9, E, and F are distinguished by different doping 
properties, or by their resistivities at T = 300 K: 21.5f2-cm (7), 34.5 fi-cm (9), 18.5 H-cm 
(E), and 46.0^-cm (F). 



1.8.4 Onsager Relations 

Transport phenomena are described in general by a set of linear relations, 

Ji = T-'ik Fk i 



(1.211) 



where the {F k } are generalized forces and the {Ji} are generalized currents. Moreover, 
to each force Fi corresponds a unique conjugate current Ji, such that the rate of internal 
entropy production is 

s »/, = • ()s 



Fi = 



dJ* 



(1.212) 



The Onsager relations (also known as Onsager reciprocity) states that 

L ik {B) = rji7] k L ki (-B) , 
where r/i describes the parity of Ji under time reversal: 

T ' J% — TJi Ji . 

We shall not prove the Onsager relations. 



(1.213) 



(1.214) 
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The Onsager relations have some remarkable consequences. For example, they require, for 
B = 0, that the thermal conductivity tensor k - of any crystal must be symmetric, indepen- 
dent of the crystal structure. In general, this result does not follow from considerations of 
crystalline symmetry. It also requires that for every 'off-diagonal' transport phenomenon, 
e.g. the Seebeck effect, there exists a distinct corresponding phenomenon, e.g. the Peltier 
effect. 

For the transport coefficients studied, Onsager reciprocity means that in the presence of an 
external magnetic field, 

Pa P ( B ) = Ppa(- B ) (1-215) 

\tf (*) = n pa {-B) (1.216) 
n ap (B) = TQp a (-B) . (1.217) 

Let's consider an isotropic system in a weak magnetic field, and expand the transport 
coefficients to first order in B: 

P a ^B) = p5 ap + ue aPl B^ (1.218) 

^{B) = K5 ap + we aPl B^ (1.219) 

Q aP (B) = Q8 af} + (e af3j B^ (1.220) 

^B) = U5 a0 + 9e a ^ . (1.221) 
Onsager reciprocity requires H = T Q and = T(. We can now write 

£ = pj + vj x B + QVT + CVT x B (1.222) 
j q = Hj + 0j x B - kVT -mVT x B . (1.223) 

There are several new phenomena lurking! 

. Hall Effect (f = f = j y = 0) 

An electrical current j = j x x and a field B = B z z yield an electric field £ . The Hall 
coefficient is i?n = £y/jx B z = —v. 

• Ettingshausen Effect = j y = j q , y = 0) 

An electrical current j = j x x and a field B = B z z yield a temperature gradient ^ . 



The Ettingshausen coefficient is P = ^/j x B z = —0/n. 

dy 



Nernst Effect (j x = j v = % = 0) 



A temperature gradient VT = ^ x and a field B = B z z yield an electric field £. 
The Nernst coefficient is A = £ y j% B z = -(. 

• Righi-Leduc Effect (j x = j y = £ y = 0) 

A temperature gradient V T = ^ x and a field B = B z z yield an orthogonal tem- 
perature gradient The Righi-Leduc coefficient is C = ^/'t^B z = (/Q. 
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1.9 Electron-Phonon Scattering 



1.9.1 Introductory Remarks 



We begin our discussion by recalling some elementary facts about phonons in solids: 



• In a crystal with r atoms per unit cell, there are 3(r — 1) optical modes and 3 acoustic 
modes, the latter guaranteed by the breaking of the three generators of space transla- 
tions. We write the phonon dispersion as cj = uj x (q), where A £ {1, . . . , 3r} labels the 
phonon branch, and q 6 Cl. If j labels an acoustic mode, uij{q) = Cj{q) q as q — ► 0. 

• Phonons are bosonic particles with zero chemical potential. The equilibrium phonon 
distribution is 

n° = 1 . (1.224) 

qX exp(^ A (g)A B T)-l 

• The maximum phonon frequency is roughly given by the Debye frequency u> D . The 
Debye temperature D = huo u ~ 100 K - 1000 K in most solids. 



At high temperatures, equipartition gives ((5Ri) 2 ) oc k B T, hence the effective scattering 
cross-section a tot increases as T, and r |> l/n ion v F a tot oc T -1 . From p = m*/ne 2 T, then, 
we deduce that the high temperature resistivity should be linear in temperature due to 
phonon scattering: p(T) oc T. Of course, when the mean free path i = v f t becomes as 
small as the Fermi wavelength A F , the entire notion of coherent quasiparticle transport 
becomes problematic, and rather than continuing to grow we expect that the resistivity 
should saturate: p(T — ► oo) h/k F e 2 , known as the Ioffe-Regel limit. For k F = 10 8 cm _1 , 
this takes the value 260 pVL cm. 



1.9.2 Electron-Phonon Interaction 

Let i?j = i2° + <5i2j denote the position of the i th ion, and let U(r) = — Ze 2 exp(— r/A TF )/r 
be the electron-ion interaction. Expanding in terms of the ionic displacements 6R4, 

H cl _ ion = Y^U(r- fl?) - SR i- V U ( r ~ > ( L225 ) 

i i 

where i runs from 1 to A^i on 13 . The deviation SR4 may be expanded in terms of the vibra- 
tional normal modes of the lattice, i.e. the phonons, as 




(1.226) 



We assume a Bravais lattice, for simplicity. 
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<- 



transverse 



longitudinal 



Figure 1.14: Transverse and longitudinal phonon polarizations. Transverse phonons do 
not result in charge accumulation. Longitudinal phonons create local charge buildup and 
therefore couple to electronic excitations via the Coulomb interaction. 



The phonon polarization vectors satisfy e x (q) = e* x (—q) as well as the generalized orthonor- 
mality relations 



a 

Y j e a x {q)e{{-q)=M-H a ^ 



(1.227) 
(1.228) 



where M is the ionic mass. The number of unit cells in the crystal is iVi OI1 = V/Q, where 
Q is the Wigner-Seitz cell volume. Again, we approximate Bloch states by plane waves 
4>k( r ) = exp(ik ■ r)/VV, in which case 



(k'\VU(r-R° i )\k) = -^ e^- k '>^ ^ {k - * 



(k — k f ^ -\- Arpp 



The sum over lattice sites gives 



A ? io 



e H ' 1 — iV ion Ok',k+q mod G , 



i=l 



so that 




(1.229) 



(1.230) 



(1.231) 
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with 

/ X 1 / 2 2 

, A (»,* +fl+0) = - j ^_A^) _^_ g (<+g) .,.(,). (1 . 232 , 

In an isotropic solid 14 ('jellium'), the phonon polarization at wavevector q either is parallel 
to q (longitudinal waves), or perpendicular to q (transverse waves). We see that only 
longitudinal waves couple to the electrons. This is because transverse waves do not result 
in any local accumulation of charge density, and it is to the charge density that electrons 
couple, via the Coulomb interaction. 



Restricting our attention to the longitudinal phonon, we have e L (q) = q/vM and hence, 
for small q = k! — k, 

^ k ^-i^T0?^'' w/2 - (L233) 

where c L is the longitudinal phonon velocity. Thus, for small q we that the electron- 
longitudinal phonon coupling g h (k, k + q) = g q satisfies 

\ 9q \ 2 = Ad-ph • ^ , (1-234) 

where g(e F ) is the electronic density of states, and where the dimensionless electron-phonon 
coupling constant is 



2Mcin g (e F ) 



B^S 



with S = hc L k F /k B . Table 1.3 lists S , the Debye temperature G D , and the electron-phonon 
coupling A c i_ph for various metals. 

EXERCISE: Derive eqn. (1.235). 



Metal 


e s 


e D 


Ael-ph 


Metal 


e s 


e D 


Ael-ph 


Na 


220 


150 


0.47 


Au 


310 


170 


0.08 


K 


150 


100 


0.25 


Be 


1940 


1000 


0.59 


Cu 


490 


315 


0.16 


Al 


910 


394 


0.90 


Ag 


340 


215 


0.12 


In 


300 


129 


1.05 



Table 1.3: Electron-phonon interaction parameters for some metals. Temperatures are in 
Kelvins. 



The jellium model ignores G / Umklapp processes. 
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1.9.3 Boltzmann Equation for Electron-Phonon Scattering 



Earlier we had quoted the result for the electron-phonon collision integral, 

Zk{f,n} = r^^\g x {k,k')\ 2 {(l- f k )f kl (l + n qyX )5{e k + huj qX -e kl ) 

k',X 

+ (1 - fk) fk> n- q \5(e k - hj- q x - e k >) 
-fk (1 - fk') (1 + n- q x) 5(e k - fiw-gx - e k >) 

~fk (1 - fk') n q\ K £ k + foWq\ ~ £fc') j^fc'-fc mod G ■ 



(1.236) 



The four terms inside the curly brackets correspond, respectively, to cases (a) through (d) 
in fig. 1.1. The (1 + n) factors in the phonon emission terms arise from both spontaneous 
as well as stimulated emission processes. There is no spontaneous absorption. 

EXERCISE: Verify that in equilibrium l k {f°, n } = 0. 

In principle we should also write down a Boltzmann equation for the phonon distribution 
n qX and solve the two coupled sets of equations. The electronic contribution to the phonon 
collision integral is written as J q x{f,n}, with 



J q x{f,n}= ( 



dn 



q\ 



at 



coll 



47T 

w 



%J 2 E{( 1 + %a)Wi-4) 

ken 

~ %x fk (1 " 4 +9 )} x S(e k+q -e k - huiqx) . (1.237) 



Here, we will assume that the phonons are always in equilibrium, and take n qX = n qX . 
Phonon equilibrium can be achieved via anharmonic effects (i.e. phonon-phonon scattering), 
or by scattering of phonons from impurities or crystalline defects. At low temperatures, 



Alo 2 impurity scattering 

B lo 2 T 3 anharmonic phonon scattering 

CjL boundary scattering (L = crystal size) 



(1.238) 



where A, B, and C are constants. 
We now linearize I k {f}, and obtain 
2vr 



qX 



C6 f=W^ 9qX n ^-fk + <^ 6 fk + q-(fk + q + <xW k 5{e k+q -e k -f^ q x) 



- [(1 - fk+q + n°_ qX )6f k - {fk + n\xW k+ q] ^k +q - + t^-qx)} • (1-239) 



This integral operator must be inverted in order to solve for 5f k in 

or 



C5f = ev • £ 



de 



(1.240) 
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Unfortunately, the inversion is analytically intractable - there is no simple solution of the 
form 5fk = £T~ k v k ■ £ (df°/de) as there was in the case of isotropic impurity scattering. 
However, we can still identify the coefficient of — dfk in C5f as the scattering rate t^ 1 . As 
before, r fc in fact is a function of the energy e(k): 

+ [1 + /»(£') + 4_ k ] S(e' - e + Hw k _„,)} (1.241) 

In an isotropic system, r(e(fc)) is independent of k. This means we can take k = y^2m*e/h 2 z 
in performing the above integral. 

It is convenient to define the dimensionless function 

" 2F( -» S J* J" 8 *' - "•>-> • (L2 " 2) 

For parabolic bands, one obtains 

V ; STrWm*^/^ hk F F J K ' ' ; 

—J 0(2A; B e s - fiw) . (1.243) 
The scattering rate is given in terms of a 2 F(uj) as 

oo 

-^y = 2^ Jdua 2 F(w){f(e + hu)-f(e-hu) + 2n (<j) + l}. (1.244) 
o 

At T = we have /°(e) = 9(e p — e) and n°(uj) = 0, whence 

oo 

-^y = 2vr Jdu;a 2 F(uj) |e(e F - e - huo) - 9(e p - e + fro;) + 1 j 



^el-ph 27T l £ £ f! 3 if | c _ c \ ^ Oh ft 

12 a (fc B e s ) 2 11 l £ ep l ^ Zft B^s 
f • (fc B e s ) it |e - e p | > 2k B Q s . 



(1.245) 



Note that r(e F ) = oo, unlike the case of impurity scattering. This is because at T = there 
are no phonons! For the divergence is cut off, and one obtains 



G{y) = Jdx; 



( 7 

V 2 
2T 



2 sinh x 

o 



4 C(3) if V = oo 

(1.247) 

\y if y < i , 
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and so 

— 2ft @2- A c l_ph II J <5k u s 

(1.248) 

ffe B TA c i_ ph ifT»e s 

This calculation predicts that r oc T~ 3 at low temperatures. This is correct if r is the 
thermal lifetime. However, a more sophisticated calculation shows that the transport lifetime 
behaves as r tr oc T -5 at low T. The origin of the discrepancy is our neglect of the (1 — cos i?) 
factor present in the average of the momentum relaxation time. At low T, there is only 
small angle scattering from the phonons, and (t? 2 ) oc (q 2 /k 2 } oc T 2 . The Wiedemann- Franz 
law, r a = t k , is valid for k B T >hc L k F , as well as at low T in isotropic systems, where 
impurity scattering is the dominant mechanism. It fails at intermediate temperatures. 



1 



Chapter 2 



Mesoscopia 



2.1 References 

• Y. Imry, Introduction to Mesoscopic Physics 



• M. Janssen, Fluctuations and Localization 

• D. Ferry and S. M. Goodnick, Transport in Nanostructures 



2.2 Introduction 

Current nanofabrication technology affords us the remarkable opportunity to study con- 
densed matter systems on an unprecedented small scale. For example, small electron boxes 
known as quantum dots have been fabricated, with characteristic size ranging from 10 nm 
to 1 fim; the smallest quantum dots can hold as few as one single electron, while larger 
dots can hold thousands. In systems such as these, one can probe discrete energy level 
spectra associated with quantization in a finite volume. Oftentimes systems are so small 
that Bloch's theorem and the theoretical apparatus of Boltzmann transport are of dubious 
utility. 



2.3 The Landauer Formula 

Consider a disordered one-dimensional wire connected on each end to reservoirs at fixed 
chemical potential /x L and For the moment, let us consider only a single spin species, 
or imagine that the spins are completely polarized. Suppose further that fi h > fi K , so that 
a current I flows from the left reservoir (L) to the right reservoir (R). Next, consider a 
cross-sectional surface £ just to the right of the disordered region. We calculate the current 
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flowing past this surface as a sum over three terms: 

h = -e JdeN{e)v{e)[T{e)f{e-^) + R!{e)f{e-^)-f{e-^)} ■ (2.1) 

Here, Af(e) is the density of states in the leads per spin degree of freedom, and corresponding 
to motion in a given direction (right or left but not both); v{e) is the velocity, and f(e— Ml,r) 
are the respective Fermi distributions. T(e) is the transmission probability that electrons of 
energy e emerging from the left reservoir will end up in right reservoir; R'(e) is the reflection 
probability that electrons emerging from right reservoir will return to the right reservoir. 
The three terms on the right hand side of (2.1) correspond, respectively, to: (i) electrons 
emerging from L which make it through the wire and are deposited in R, (ii) electrons 
emerging from R which fail to 'swim upstream' to L and are instead reflected back into 
R, and (iii) all electrons emerging from reservoir R (note this contribution is of opposite 
sign). The transmission and reflection probabilities are obtained by solving for the quantum 
mechanical scattering due to the disordered region. If the incoming flux amplitudes from 
the left and right sides are i and i', respectively, and the outgoing flux amplitudes on those 
sides d and o, linearity of the Schrodinger equation requires that 




The matrix S is known as the scattering matrix (or 5-matrix, for short). The S*-matrix ele- 
ments r, t, etc. are reflection and transmission amplitudes. The reflection and transmission 
probabilities are given by 

R=\r\ 2 T' = \t'\ 2 (2.3) 

T = \t\ 2 Rl = \r'\ 2 . (2.4) 



Going back to (2.1), let us assume that we are close to equilibrium, so the difference /x R — fi L 
in chemical potentials is slight. We may then expand 



/(e - Mr) = /(e - Ml) + /'(e - Ml) (Ml - Mr) + ■ ■ ■ 



(2.5) 



and obtain the result 



(Mr-Ml) JdeM{e)v{ 
^Mr - Ml) 



- M-f ) no 



(2.6) 



valid to lowest order in (/% — /j> l ). We have invoked here a very simple, very important 
result for the one-dimensional density of states. Considering only states moving in a definite 
direction (left or right) and with a definite spin polarization (up or down), we have 



M(e) de 



dk 
2^ 



1 dk 
2^de 



1 



hv(e) 



(2.7) 
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where h = 2ttH is Planck's constant. Thus, there is a remarkable cancellation in the product 
M(e) v(e) = h' 1 . Working at T = 0, we therefore obtain 

1= fan-^)T(e F ) , (2.8) 

where T(e F ) is the transmission probability at the Fermi energy. The chemical potential 
varies with voltage according to fJ,(V) = /x(0) — eV, hence the conductance G = I/V is 
found to be 

e 2 

G = — T(e F ) (per spin channel) (2.9) 

2e 2 

= T(e F ) (spin degeneracy included) (2.10) 

The quantity h/e 2 is a conveniently measurable 25, 813 Cl. 

We conclude that conductance is transmission - G\se 2 /h times the transmission probability 
T(e F ) with which an electron at the Fermi level passes through the wire. This has a certain 
intuitive appeal, since clearly if T(e F ) = we should expect G = 0. However, two obvious 
concerns should be addressed: 



• The power dissipated should be P = I 2 R = V 2 G. Yet the scattering in the wire is 
assumed to be purely elastic. Hence no dissipation occurs within the wire at all, and 
the Poynting vector immediately outside the wire must vanish. What, then, is the 
source of the dissipation? 

• For a perfect wire, T(e F ) = 1, and G = e 2 /h (per spin) is finite. Shouldn't a perfect 
(i.e. not disordered) wire have zero resistance, and hence infinite conductance? 



The answer to the first of these riddles is simple - all the dissipation takes place in the R 
reservoir. When an electron makes it through the wire from L to R, it deposits its excess 
energy ^ L — [i R in the R reservoir. The mechanism by which this is done is not our concern 
- we only need assume that there is some inelastic process (e.g. electron-phonon scattering, 
electron-electron scattering, etc.) which acts to equilibrate the R reservoir. 

The second riddle is a bit more subtle. One solution is to associate the resistance h/e 2 of a 
perfect wire with the contact resistance due to the leads. The intrinsic conductance of the 
wire G\ is determined by assuming the wire resistance and contact resistances are in series: 

e h 1 - T{e T ) h fl ( £p ) 

where G{ is the intrinsic conductance of the wire, per spin channel. Now we see that when 
T(e F ) — > 1 the intrinsic conductance diverges: G\ — > oo. When T <C 1, G\ ~ G = (e 2 /h)T. 
This result (2.11) is known as the Landauer Formula. 



To derive this result in a more systematic way, let us assume that the disordered segment 
is connected to the left and right reservoirs by perfect leads, and that the leads are not in 
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equilibrium at chemical potentials fi h and fj, R but instead at j2 L and Ar- To determine Al 
and jl R , we compute the number density (per spin channel) in the leads, 

n L = JdeM(e){[l + R(e)]f(e-^) + T'(e)f(e-fi R )} (2.12) 

n R = jdeM{e)[T{e)f{s - Ml ) + [l + R'(e)] f(e - Mr )} (2.13) 

and associate these densities with chemical potentials Al and Ar according to 

Or -AO (2.14) 

n R = 2 JdeM(e)f(e-ji R ) , (2.15) 



:JdeM(e)f(e 



where the factor of two accounts for both directions of motion. To lowest order, then, we 
obtain 

2 (Ml - Al) = (Ml ~ Mr) t ' Ml = Ml + | (/i R - /x L ) (2.16) 

2(Mr - Ar) = (Mr - Ml) T =► Mr = Mr + 5 T (Ml ~ Mr) (2-17) 

and therefore 

(Ml " Mr) = (1 - |T - iT') (/x L - Mr ) 

= (1 - T) (/i L - /. R ) , (2.18) 

where the last equality follows from unitarity (<StS = <S<St = 1). There are two experimental 
configurations to consider: 

• Two probe measurement - Here the current leads are also used as voltage leads. The 
voltage difference is AV = (fi R — Ml)/ c an d the measured conductance is G 2 piohc = 

(e 2 /h)T(e F ). 

• Four probe measurement - Separate leads are used for current and voltage probes. 
The observed voltage difference is AV = (Ar — Al)/ 6 an d the measured conductance 
*G 4 _ prohe = (eyh)T(e F )/R(e F ). 



2.3.1 Example: Potential Step 

Perhaps the simplest scattering problem is one-dimensional scattering from a potential step, 
V(x) = Vq Q(x). The potential is piecewise constant, hence the wavefunction is piecewise a 
plane wave: 

x<0: i/>(x) = I e ikx + O' e~ ikx (2.19) 
x>0: i/>(x) = Oe ik ' x + l'e- ik ' x , (2.20) 
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V = E 



v=v 



v=o 



Figure 2.1: Scattering at a potential step. 



with 



21./2 



+ ^0 



(2.21) 



ft 2 fc 2 _ 
2m 2m 

The requirement that ip(x) and its derivative Y>'(x) be continuous at x = gives us two 
equations which relate the four wavefunction amplitudes: 

I + 0' = + l' (2.22) 
k(I - O') = k'{0 - I') . (2.23) 

As emphasized earlier, the 5-matrix acts on flux amplitudes. We have 



(2.24) 



with v = hk/m and v' = hk' /m. One easily finds the S-matrix, defined in eqn. 2.2, is given 
by 



2yS 
1+e 



1-6 

.1+e 



1+e 
1+e 



(2.25) 



where e = v'/v = k'/k = yl - ^, where E = e p is the Fermi energy. The two- and 
four-terminal conductances are then given by 

e 2 . ,, e 2 4e 

- W » , - - 



G 



2— probe 



h (1 



G 



4— probe 



e 2 \tf 
h Irl 2 



4e 



/i (1 



(2.26) 
(2.27) 



Both are maximized when the transmission probability T = \t\ 2 = 1 is largest, which occurs 
for e = 1, i.e. k! = k. 
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k'/k 

Figure 2.2: Dimensionless two-terminal conductance g versus k'/k for the potential step. 
The conductance is maximized when k! = k. 



2.4 Multichannel Systems 

The single channel scenario described above is obtained as a limit of a more general mul- 
tichannel case, in which there are transverse degrees of freedom (due e.g. to finite cross- 
sectional area of the wire) as well. We will identify the transverse states by labels i. Within 
the perfect leads, the longitudinal and transverse energies are decoupled, and we may write 

e = £±i +e ll (k) , (2.28) 

where £n(k) is the one-dimensional dispersion due to motion along the wire (e.g. £»(k) = 

h 2 k 2 /2m* , £n(k) = — 2icos ka, etc.). k is the component of the wavevector along the axis of 

the wire. We assume that the transverse dimensions are finite, so fixing the Fermi energy e F 
in turn fixes the total number of transverse channels, N c , which contribute to the transport: 

N c (e) = ^ 6(e - e ± .) (continuum) (2.29) 

i 

= ^2e(2t-\e-e ±i \) (tight binding) . (2.30) 

Equivalently, an electron with energy e in transverse state i has wavevector fej which satisfies 



£ \\( k i) = £ - £ u 



(2.31) 
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N c is the number of real positive roots of (2.31). Typically iV c m kp 1 A, where A is the 
cross-sectional area and k F is the Fermi wavevector. The velocity Vi is 



1 de 
hdk 



1 de {{ (k) 
h dk 



(2.32) 



k=k- 



The density of states Mi{e) (per unit spin, per direction) for electrons in the i th transverse 
channel is 

n 

so once again we have for the product h Vi(s) Mi(e) = 1. 



27r de„ 



(2.33) 



Consider now a section of disordered material connected to perfect leads on the left and 
right. The solution to the Schrodinger equation on either side of the disordered region is 



^ eft (x ± ,z) = J2{l j ^ z + 0' i e- ik ^ 

3=1 

^rightK' = E {°a ^ kaZ + I'a Z- lkaZ } tfixj ■ 
a=l 



(2.34) 



(2.35) 



Here, we have assumed a general situation in which the number of transverse channels N^' R 
may differ between the left and right lead. The quantities {/•, O'j, O a ,I' a } are wave function 
amplitudes. The iS-matrix, on the other hand, acts on flux amplitudes {i -, o'j, o a , i' a }, which 
are related to the wavefunction amplitudes as follows: 



The S-matrix is a (7V C R + 7V C L ) x (iV c R + iV c L ) matrix, 

S 



> t N R x N l r AT R x AT R , 



(2.36) 
(2.37) 

(2.38) 



which relates outgoing and incoming flux amplitudes: 

S 



r t' 
t r' 



Unitarity of S means that S^S = SS^ = I, where 



r t! 
t r' 



5t 



(2.39) 



(2.40) 
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and hence unitarity says 



* I it i/* c 

^ik T jk ' Z ic Z jc "ij 
tak t*bk + r 'ac r 'b*c = Sab 
r ik l ak + l ic r ac = 



^"ki ^kj ' Z ci cj ®ij 
Z ka Z kb + r ca r cb — °ab 
r ki tka + T ci r ca = , 



(2.41) 
(2.42) 
(2.43) 



or, in matrix notation, 



ttf + r' r /f 
r i f + t' r /f 
t r f + r' i /f 



N^xNk 



t't r + r't t 



(2.44) 
(2.45) 
(2.46) 
(2.47) 



We define the probabilities 



Af c L 

R i = Yl r *ik 



k=l 



7? = £4* 



c=l 



-^a = ^1 ^afc ^afc 
fc=l 

i?' = V r'* r' 

- 1 u a / ., ' ac ' ac 
c=l 



(2.48) 
(2.49) 



for which it follows that 



iL + 2? = 1 



< + r„ = i 



(2.50) 



for all i G {1, ... , N£} and a £ {1, . . . , N™}- Unitarity of the 5-matrix preserves particle 
flux: 



U2_U'|2 = | |2 , /|2 



(2.51) 



which is shorthand for 



jV c L iV c R Af c R 

EN/+EKi 2 = EH 2 +EKi 2 - 

j=l a=l a=l j=l 



(2.52) 



Onsager reciprocity demands that S(—H) = S t (H). 
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Let us now compute the current in the right lead flowing past the imaginary surface E 

Ta(e) 



L 



E deM a {e)v a {e)\ EM^I* /( e " ^ 

a=l J I i=l 



K(e) 



+ 



iV R 

EK^)! 2 - 1 



6=1 



f(e ~ Mr) 



= H(*-^/*(-^)g r -W- 



Thus, the result of a two-probe measurement would be 

el 



e~ I , I df° 



G 2 _ probc = — = j ids i i E w • 

Mr - Ml \ ° £ J a=1 



JVi" 



At zero temperature, then, 



where 



G 2 _ probc = j Tr tt t 



7V C L 7V R 

Trttt = Trttt = X;El*«i| 2 

i=l a=l 



(2.53) 



(2.54) 



(2.55) 



(2.56) 



To determine G 4 _ probe , we must compute the effective chemical potentials /x L and /2 R in the 
leads. We again do this by equating expressions for the electron number density. In the left 
lead, 



E fdeN^Ul + R t {e)]f{e - Ml) + ^(e) f(e - Mr ) 
i=i 

2^ |&AA t ( £ ) f{e-K) 



Ml = Ml - 2 T ' (Lt L - H R ) 



where T 1 is a weighted average, 



Y-vT x T' 



T = 



(2.57) 

(2.58) 
(2.59) 

(2.60) 
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Similarly, one obtains for the right lead, 

N R 

n R = E jdeN a {e)\Ta{e) f(e - Ml ) + [l + R' a (s)] f(e - /i R )| (2.61) 

a=l J (■ J 

= 2^ fdeN a {e)f{e-~^) (2.62) 

=> Ar = Mr. + \T - n R ) (2.63) 

where 

T = ^ a a a . (2.64) 

(We have assumed zero temperature throughout.) The difference in lead chemical potentials 
is thus 

(Al - Ar) = (l - |T - ^T 7 ) • - Mr) • (2-65) 
Hence, we obtain the 4-probe conductance, 



e 2 J2 T a 



2.4.1 Transfer Matrices: The Pichard Formula 

The transfer matrix 5 acts on incoming flux amplitudes to give outgoing flux amplitudes. 
This linear relation may be recast as one which instead relates flux amplitudes in the right 
lead to those in the left lead, i.e. 

x M 



o<\ Jr l'\ [i\ fo\ _(M U M 12 

21 



t r'J V t'J ' U'J [Mni MnJ V°" 



M. is known as the transfer matrix. Note that each of the blocks of M. is of dimension 
x N£, and M. itself is a rectangular 2iV ( f x matrix. The individual blocks of .M 
are readily determined: 

o' = ri + t'i' => i = -t'^ri + t'- 1 d (2.68) 

o = ti + r'i / => o = (t-r't'~ 1 r)i + r't'- 1 o' , (2.69) 

so we conclude 

A/t u = t t_1 7W 12 = r't / - 1 (2.70) 

^21 = -^^ ^22 = * /_1 - (2-71) 

WARNING: None of this makes any sense if N£ 7^ iV,? / The reason is that it is prob- 
lematic to take the inverse of a rectangular matrix such as t or t' , as was blithely done 
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#1 



#2 



i 



Figure 2.3: Two quantum scatterers in series. The right side data for scatterer #1 become 
the left side data for scatterer #2. 



above in (2.68,2.69). We therefore must assume = N£ = 7V C , and that the scatterers are 
separated by identical perfect regions. Practically, this imposes no limitations at all, since 
the width of the perfect regions can be taken to be arbitrarily small. 

EXERCISE: Show that M n = t - r' t 1 ' 1 r = ft" 1 . 

The virtue of transfer matrices is that they are multiplicative. Consider, for example, two 
disordered regions connected by a region of perfect conductor. The outgoing flux o from 
the first region becomes the incoming flux i for the second, as depicted in fig. 2.3. Thus, 
if A4 t is the transfer matrix for scatterer and M 2 is the transfer matrix for scatterer 
#2, the transfer matrix for the two scatterers in succession is A4 = M 2 -^v 



M=M 2 M 1 




(2.72) 



Clearly, then, if we have many scatterers in succession, this result generalizes to 

M = M N M N _ 1 ---M l . (2.73) 

Unitarity of the 5-matrix means that the transfer matrix is pseudo-unitary in that it satisfies 

fl O \ 

M^EM = E where E = [q c * No _j cXNc ) ■ (2-74) 

V N c xN c N c xN c J 
This, in turn, implies conservation of the pseudo-norm, 



(2.75) 
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which is simply a restatement of (2.51). 
We now assert that 



M ] M + {M ] M)- 1 +2-1 
This result is in fact easily derived once one notes that 



1 _ i (th 

"no t'tt 



(2.76) 



-m\ 2 m\ 2 



(2.77) 



EXERCISE: Verify eqn. (2.76). 



The 2-probe conductance (per spin channel) may now be written in terms of the transfer 
matrix as 



2e z 

r - Tr 

LT 2-probe ~ 



M^M + iM^M)- 1 + 2-1 



(2.78) 



This is known as the Richard Formula. 



2.4.2 Discussion of the Pichard Formula 

It is convenient to work in an eigenbasis of the Hermitian matrix Ai^Ai. The eigenvalues 
of M^M are roots of the characteristic polynomial 



p(A) = det (A - M^M) . 

Owing to the pseudo-unit arity of At, we have 

p(A) = det(A-M t M) 

= det(A - £ AT 1 £ • £ M^-i S ) 

= det(A-£AT 1 .M t - 1 £) 

= A 2JVc det (A" 1 - /det^M) , 



(2.79) 



(2.80) 



from which we conclude that p(A) = implies p(X 1 ) = 0, and the eigenvalues of Aft Ai 
come in (A, A -1 ) pairs. We can therefore write Pichard's formula as 



,2 N * 



G. 



2— probe 



-Y 

h 4-i 



^A. + A^ + 2 



(2.81) 



where without loss of generality we assume \ > 1 for each i £ {1, . . . , ./V c }. We define the 
i tb localization length £j through 



,'2L\ 
Aj = exp ( — 1 



2L 

In A, ; 



(2.82) 
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where L is the length of the disordered region. We now have 



G 



2— probe 



e 
~h 



N c 



i=l 



+ cosh(2L/6) 



(2.83) 



If N c is finite, then as L — ► oo the converge to definite values, for a wide range 
of distributions P(M n ) for the individual scatterer transfer matrices. This follows from 
a version of the central limit theorem as applied to nonabelian multiplicative noise (i.e. 
products of random matrices), known as Oseledec's theorem. We may choose to order the 
eigenvalues such that A 1 < A 2 < • • • < A^ , and hence £ x > £ 2 > ■ ■ ■ > £, N ■ In the L — ► oo 
limit, then, the conductance is dominated by the largest localization length, and 

G(L) ~ — e~ 2L ^ (N c finite, L -> oo) . (2.84) 

(We have dropped the label '2-probe' on G.) The quantity £ = £i is called the localization 
length, and it is dependent on the (Fermi) energy: £ = 

Suppose now that L is finite, and furthermore that ^ > 2L > ^ . Channels for which 
2L <C £j give cosh(2L/£j) ~ 1, and therefore contribute a quantum of conductance e 2 /h 
to G. These channels are called open. Conversely, when 2L S> £j, we have cosh(2L/£j) ~ 
| exp(2L/£j) 3> 1, and these closed channels each contribute AGj = (2e 2 /h)e~ 2L /^ to the 
conductance, a negligible amount. Thus, 

e 2 ^ 
G(L) ~ ^ Af c opcn , 7V c open = 9(& - 2L) . (2.85) 

i=i 

Of course, 7V£ losed = N c — N° pcn , although there is no precise definition for open vs. closed 
for channels with £j ~ 2L. This discussion naturally leads us to the following classification 
scheme: 



• When L > £i, the system is in the localized regime. The conductance vanishes expo- 
nentially with L according to G(L) ~ (4e 2 //i) exp(— 2L/£), where £(e) = is the 
localization length. In the localized regime, there are no open channels: Nc pen = 0. 

• When N° pcn = £N C /L, where I is the elastic scattering length, one is in the Ohmic 
regime. In the Ohmic regime, for a d-dimensional system of length L and ((d — 1)- 
dimensional) cross-sectional area A, 

G o hmi c-xz^ lA = x^ l£ -z- (2 - 86) 

Note that G is proportional to the cross sectional area A and inversely proportional 
to the length L, which is the proper Ohmic behavior: G = a A/L, where 

a^—k d - x l (2.87) 

is the conductivity. 
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• When L < £ N , all the channels are open: iV° pen = N c . The conductance is 



G(L) = —N c ^— k^ L A . (2.88) 



This is the ballistic regime. 



If we keep N c oc (kpL)^ 1 , then for L — > oo Oseledec's theorem does not apply, because 
the transfer matrix is oo-dimensional. If ^(e) nonetheless remains finite, then G{L) ~ 
(4e 2 //i) exp(— L/£) — > and the system is in the localized regime. If, on the other hand, 
^i(s) diverges as L — > oo such that exp(L/^ 1 ) is finite, then G > and the system is a 
conductor. 

If we define v i = In X { , the dimensionless conductance g = (h/e 2 ) G is given by 

oo 


where 

<T{v) = Y i 8{v-v i ) (2.90) 

i=i 

is the density of v values. This distribution is normalized so that f^dv a(v) = N c . Spec- 
tral properties of the {v{\ thus determine the statistics of the conductance. For example, 
averaging over disorder realizations gives 



r (<r(v)) 

(ff) = 2 du - V V ' . (2.91) 
x ' J 1+coshz^ v ; 



uis — 



The average of g 2 , though, depends on the two-point correlation function, viz. 



oo oo 

a(v) a(v' 



f) = 4 \dv du' \, ' » • (2.92) 

1 J J (1 + cosh + cosh iX) v ; 





2.4.3 Two Quantum Resistors in Series 

Let us consider the case of two scatterers in series. For simplicity, we will assume that 
N c = 1, in which case the transfer matrix for a single scatterer may be written as 

-['//) ■ <»»> 

A pristine segment of wire of length L has a diagonal transfer matrix 
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where (5 = kL. Thus, the composite transfer matrix for two scatterers joined by a length L 
of pristine wire is M = M 2 NM 1 , i.e. 

In fact, the inclusion of the transfer matrix M is redundant; the phases e ±tf3 can be com- 
pletely absorbed via a redefinition of {t±, t^, r±, r[}. 



Extracting the upper left element of M gives 

1 e^-e-^rjVS 



t* t\t* 2 



hence the transmission coefficient T for the composite system is 



(2.96) 



T = TlT2 ; (2.97) 

1 + RiR 2 - 2^RJT 2 cos <5 

where 5 = 2/3 + arg(r' 1 r 2 ). The dimensionless Landauer resistance is then 

_ R _ Ri + R 2 - 2 v / i^R 2 cos 5 
~ T ~ T\T 2 

= K 1 + K 2 + 2K 1 K 2 -2v / K 1 K 2 (l + K 1 )(l+K 2 ) cos 5 . (2.98) 



If we average over the random phase 5, we obtain 

(K) = Tlx +Tl 2 + 2K 1 K 2 . (2.99) 

The first two terms correspond to Ohm's law. The final term is unfamiliar and leads to 
a divergence of resistivity as a function of length. To see this, imagine that that TZ 2 = 
gdL is small, and solve (2.99) iteratively. We then obtain a differential equation for the 
dimensionless resistance 1Z{L): 

dK=(l + 2K)gdL Ti{L) = \{e 2eL - I) . (2.100) 



In fact, the distribution P L (7Z) is extremely broad, and it is more appropriate to average 
the quantity ln(l + TV). Using 



2tt 



with 



^ ln(a-6cos<5) = ln(ja+ \\/a 2 -b 2 ^ (2.101) 



a=l + K 1 + K 2 + 2K 1 K 2 (2.102) 
b = 2y/KiK 2 (l +n l ){l+TZ 2 ) , (2.103) 
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we obtain the result 



We define the quantity 



and we observe 



(ln(l + 11)) = ln(l + Hi) + ln(l + TZ 2 ) . 
x{L)=\n{l + K{L)} , 

(x{L)) = Q L 



(2.104) 
(2.105) 

(2.106) 
(2.107) 



Note that (e x ) / e^ x) . The quantity x(L) is an appropriately self- averaging quantity in 
that its root mean square fluctuations are small compared to its average, i.e. it obeys the 
central limit theorem. On the other hand, 1Z(L) is not self-averaging, i.e. it is not normally 
distributed. 



Abelian Multiplicative Random Processes 

Let p(x) be a distribution on the nonnegative real numbers, normalized according to 



oo 

fdxp(x) = 1 , 



and define 



N 



N 



X; 



Y = InX = J^ln 



X; . 



1=1 



1=1 



The distribution for Y is 

oo oo 



oo oo oo / N 

MY) = ■ • • ■ ■ -p( x n) * (y - 5> 

n n n \ *— 1 





oo 



-/ 

— oo 
oo 

-/ 



2^ 



o 

'OO 



A? 



do; 
2^ 



1 - iw(lnx) - iw 2 (ln 2 x) + C(w 3 ) 



iV 



-oo 
oo 



with 



duJ iu,(Y-N(\nx)) ~\Nu> 2 ({\n 2 x)-{\nx) 2 )+0(uj' A ) 

H = (In a;) , o" 2 = (ln 2 x) — (lnx) 2 



(2.108) 



(2.109) 



(2.110) 
(2.111) 
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and 

oo 

(f(x)) = Jdxp(x)f(x) . (2.112) 



Thus, Y is normally distributed with mean (Y) = N[i and standard deviation ((Y — N fi) 2 } = 
No~ 2 . This is typical for extensive self- averaging quantities: the average is proportional to 
the size iV of the system, and the root mean square fluctuations are proportional to y/N. 
Since lim/v->oo Y rms / (Y) ~ a / s/N \i — > 0, we have that 

Pn^JX)-^~^v) ■ ( 2 - 113 ) 

This is the central limit theorem (CLT) at work. The quantity Y is a sum of independent 
random variables: Y = ^ In X{, and is therefore normally distributed with a mean Y = Nfi 
and standard deviation y/Na, as guaranteed by the CLT. On the other hand, X = exp(Y) 
is not normally distributed. Indeed, one readily computes the moments of X to be 

(X k ) = e kN »e Nk2 l 2u \ (2.114) 

hence 

j**> = e N Kk - m ^ (2 115) 

which increases exponentially with N. In particular, one finds 



{X) 



(e^ 2 -l) ' . (2.116) 



The multiplication of random transfer matrices is a more difficult problem to analyze, 
owing to its essential nonabelian nature. However, as we have seen in our analysis of series 
quantum resistors, a similar situation pertains: it is the logarithm ln(l + 1Z), and not the 
dimensionless resistance 1Z itself, which is an appropriate self-averaging quantity. 

2.4.4 Two Quantum Resistors in Parallel 

The case of parallel quantum resistors is more difficult than that of series resistors. The 
reason for this is that the conduction path for parallel resistances is multiply connected, i.e. 
electrons can get from start to finish by traveling through either resistor #1 or resistor #2. 

Consider electrons with wavevector k > moving along a line. The wavefunction is 

iP(x) = I e ikx + O' e- ikx , (2.117) 
hence the transfer matrix M. for a length L of pristine wire is 

/ ikL Q \ 

M£)= e -LJ • (2-118) 
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Now let's bend our wire of length L into a ring. We therefore identify the points x = and 
x = L = 2irR, where R is the radius. In order for the wavefunction to be single-valued we 
must have 

I 



M(L)-I\ = 0, (2.119) 

and in order to have a nontrivial solution (i.e. I and O' not both zero), we must demand 
det (AA — I) = 0, which says cos kL = 1, i.e. k = 2irn/L with integer n. The energy is then 
quantized: e n = eJk = 2im/L). 

Next, consider the influence of a vector potential on the transfer matrix. Let us assume the 
vector potential A along the direction of motion is nonzero over an interval from x = to 
x = d. The Hamiltonian is given by the Peierls substitution, 

H = e^{-id x + fcA(xj) . (2.120) 



Note that we can write 



H = A*(x)eJ-id x )A(x) (2.121) 



X 



le 



A(x) = exp I — J dx A(x) \ . (2.122) 
^ o ' 

Hence the solutions ip(x) to Ttip = sip are given by 

4>(x) = IA\x) e ikx + 0'A\x) e~ ikx . (2.123) 

The transfer matrix for a segment of length d is then 

/ ikd -i7 n \ 

M(d,A)=[ Q e e -ikd e -*r) (2-124) 

with 

d 

1 = Y C jdxA(x) . (2.125) 
o 



We are free to choose any gauge we like for A(x). The only constraint is that the gauge- 
invariant content, which is encoded in he magnetic fluxes through every closed loop C, 

$ c = j>A-dl , (2.126) 

c 

must be preserved. On a ring, there is one flux $ to speak of, and we define the dimensionless 
flux cj) = e<£ /he = 2"7t<I>/(/>q, where (f) Q = hc/e = 4.137x 10 -7 G-cm 2 is the Dirac flux quantum. 
In a field of B = 1 kG, a single Dirac quantum is enclosed by a ring of radius R = 0.11 /xm. 
It is convenient to choose a gauge in which A vanishes everywhere along our loop except for 
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Figure 2.4: Energy versus dimensionless magnetic flux for free electrons on a ring. The 
degeneracies are lifted in the presence of a crystalline potential. 

a vanishingly small region, in which all the accrued vector potential piles up in a 5- function 
of strength <£. The transfer matrix for this infinitesimal region is then 

M(4>)=^ J^=e*-I. (2.127) 

If k > corresponds to clockwise motion around the ring, then the phase accrued is —7, 
which explains the sign of (p in the above equation. 

For a pristine ring, then, combining the two transfer matrices gives 

K/i _(e ikL We** 0\_/e !ti e ! * \ 

\ e-* L )\0 e^)~\ e - ikL e^) ' ^ AM) 

and thus det(7V4 — I) = gives the solutions, 

kL = 2irn — (j) (right-movers) 
kL = 2im + 4> (left-movers) . 

Note that different n values are allowed for right- and left-moving branches since by assump- 
tion k > 0. We can simplify matters if we simply write tp(x) = Ae tkx with k unrestricted 
in sign, in which case k = {2im — <j))/L with n chosen from the entire set of integers. The 
allowed energies for free electrons are then 

^)~( n -^) 2 , (2.129) 

which are plotted in fig. 2.4. 
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#2 

Figure 2.5: Scattering problem for a ring enclosing a flux <]?. The square and triangular 
blocks represent scattering regions and are describes by 2 x 2 and 3x3 5-matrices, respec- 
tively. The dot-dash line represents a cut across which the phase information due to the 
enclosed flux is accrued discontinuously. 



Now let us add in some scatterers. This problem was first considered in a beautiful paper 
by Biittiker, Imry, and Azbel, Phys. Rev. A 30, 1982 (1984). Consider the ring geometry 
depicted in fig. 2.5. We want to compute the 5-matrix for the ring. We now know 
how to describe the individual quantum resistors #1 and #2 in terms of 5-matrices (or, 
equivalently, .M-matrices). Assuming there is no magnetic field penetrating the wire (or 
that the wire itself is infinitesimally thin), we have S = S for each scatterer. In this case, 
we have t = t' = yf " e ia . We know |r| 2 = |r'| 2 = 1 — |i| 2 , but in general r and r' may have 
different phases. The most general 2x2 transfer matrix, under conditions of time-reversal 
symmetry, depends on three parameters, which may be taken to be the overall transmission 
probability T and two phases: 

MiT ^^ = jf{^Te-^ e- ) • (2 - 130) 



We can include the effect of free-particle propagation in the transfer matrix M by multi- 
plying M on the left and the right by a free propagation transfer matrix of the form 

^=( e -m) , (2-131) 

where 9 = kL = 2-rrkR is the phase accrued by a particle of wavevector k freely propagating 
once around the ring. J\f is the transfer matrix corresponding to one quarter turn around 
the ring. One easily finds 

M -^NM(T,a,f3)N = M{T,a + \0,p). (2.132) 

For pedagogical reasons, we will explicitly account for the phases due to free propagation, 
and write 

AWA^fc) , (^WaW;>), ( , 13 3) 
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where M. 2 is the transfer matrix for scatterer #2 going from right to left. 



EXERCISE: Show that the right-to-left transfer matrix M is related to the left-to-right 
transfer matrix M. according to 



M = AEM+EA , 



where 



1 
-1 



A 



1 

1 



(2.134) 
(2.135) 



We now have to model the connections between the ring and the leads, which lie at the 
confluence of three segments. Accordingly, these regions are described by 3 x 3 5-matrices. 
The constraints S = (unitarity) and S = S l (time-reversal symmetry) reduce the number 
of independent real parameters in S from 18 to 5. Further assuming that the scattering 
is symmetric with respect to the ring branches brings this number down to 3, and finally 
assuming S is real reduces the dimension of the space of allowed 5-matrices to one. Under 
these conditions, the most general 3x3 5-matrix may be written 



where 



-(a + b) v/e 
\f~e a b 
\ft b a 



(a + b) 2 + 2e = 1 



a 2 + b 2 + e 



1 . 



(2.136) 



(2.137) 
(2.138) 



The parameter e, which may be taken as a measure of the coupling between the ring and 
the leads (e = means ring and leads are decouped) is restricted to the range < e < ^. 
There are four solutions for each allowed value of e: 



±|(vT=2i-l) 



and 



±|(vT=2i+l) 



b = ±i( v / T 3 2^ + l) 
6 = ±i(v / I 3 2^-l) 



(2.139) 



(2.140) 

We choose the first pair, since it corresponds to the case \b\ = 1 when e = 0, i.e. perfect 
transmission through the junction. We choose the top sign in (2.139). 



We therefore have at the left contact, 



a 2 



a L + b 



L ' U L) 

L 
L 




(2.141) 



and at the right contact 




f-(a R + b R ) 

\ V^R 









Or 


K, 


(1 


K 


Or ) 





(2.142) 
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e L = 0.100 , € R = 0.100 , T 4 = 0.300 , T B = 0.800 
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Figure 2.6: Two-probe conductance G(4>,n) of a model ring with two scatterers. The 
enclosed magnetic flux is cphc/e, and n = 2irkR (ring radius R). G vs. 4> curves for various 
values of k are shown. 

where accounting for the vector potential gives us 

/3 1 = e i *P 1 , a' 1 = e i<l, a' 1 . (2.143) 

We set i = 1 and i' = 0, so that the transmission and reflection amplitudes are obtained 
from t = o and r = d . 

From (2.141), we can derive the relation 

( a i\ 

Similarly, from (2.142), we have 



The matrices Q L and Q R resemble transfer matrices. However, they are not pseudo-unitary: 
Q^SQ ^ E. This is because some of the flux can leak out along the leads. Indeed, when 




(2.144) 




(2.145) 
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e, = 0.010 , e„ = 0.010 , T. = 0.300 , T„ = 0.800 
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Figure 2.7: Two-probe conductance G(<fi,n) of a model ring with two scatterers. The 
enclosed magnetic flux is (fthc/e, and n = 2irkR (ring radius R). G vs. (j) curves for various 
values of k are shown. The coupling between leads and ring is one tenth as great as in 
fig. 2.6, and accordingly the resonances are much narrower. Note that the resonances at 
k = 0.457T and k = 0.85-7T are almost completely suppressed. 



e = 0, we have 6 = 1 and a = 0, hence Q = 1, which is pseudo-unitary (i.e. flux preserving). 
Combining these results with those in (2.133), we obtain the solution 




(2.146) 



From this result, using (2.133), all the flux amplitudes can be obtained. 

We can define the effective ring transfer matrix V as 

V = e** Q^N M 2 M Q R M M 1 M , (2.147) 

which has the following simple interpretation. Reading from right to left, we first move \- 
turn clockwise around the ring (Af). Then we encounter scatterer #1 (M^. After another 
quarter turn (Af), we encounter the right T-junction (Q R ). Then it's yet another quarter 
turn (Af) until scatterer ^2 (A4 2 ), and one last quarter turn (Af) brings us to the left 
T-junction (Q L ), by which point we have completed one revolution. As the transfer matrix 
acts on both right-moving and left-moving flux amplitudes, it accounts for both clockwise 
as well as counterclockwise motion around the ring. The quantity 

{I - V}' 1 = 1 + V + V 2 + V 3 + . . . , (2.148) 

then sums up over all possible integer windings around the ring. In order to properly account 
for the effects of the ring, an infinite number of terms must be considered; these may be 
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Figure 2.8: Two-probe conductance G(4>,n) of a model ring with two scatterers. The 
enclosed magnetic flux is 4>hc/e, and k = 2nkR (ring radius R). G vs. n curves for various 
values of (f> are shown. 



resummed into the matrix inverse in (2.147). The situation is analogous to what happens 
when an electromagnetic wave reflects off a thin dielectric slab. At the top interface, the 
wave can reflect. However, it can also refract, entering the slab, where it may undergo an 
arbitrary number of internal reflections before exiting. 

The transmission coefficient t is just the outgoing flux amplitude: t = o. We have from 
(2.142,2.133) that 



t = (fa e* + P 2 ) 

-f (—0(3) 



(2.149) 



where 

y = Q^NM 2 NQ K -e- i<t} N' l M^N- 1 . (2.150) 

It is straightforward to numerically implement the above calculation. Sample results are 
shown in figs. 2.6 and 2.8. 
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2.5 Universal Conductance Fluctuations in Dirty Metals 

The conductance of a disordered metal is a function of the strength and location of the 
individual scatterers. We now ask, how does the conductance fluctuate when the position 
or strength of a scatterer or a group of scatterers is changed. From the experimental point 
of view, this seems a strange question to ask, since we generally do not have direct control 
over the position of individual scatterers within a bulk system. However, we can imagine 
changing some external parameter, such as the magnetic field B or the chemical potential fi 
(via the density n). Using computer modeling, we can even 'live the dream' of altering the 
position of a single scatterer to investigate its effect on the overall conductance. Naively, we 
would expect there to be very little difference in the conductance if we were to, say, vary the 
position of a single scatterer by a distance £, or if we were to change the magnetic field by 
AB = 4> /A, where A is the cross sectional area of the system. Remarkably, though, what 
is found both experimentally and numerically is that the conductance exhibits fluctuations 
with varying field B, chemical potential fi, or impurity configuration (in computer models). 
The root-mean-square magnitude of these fluctuations for a given sample is the same as 
that between different samples, and is on the order 5G ~ e 2 /h. These universal conductance 
fluctuations (UCF) are independent on the degree of disorder, the sample size, the spatial 
dimensions, so long as the inelastic mean free path (or phase breaking length) satisfies 
L^ > L, i.e. the system is mesoscopic. 

Theoretically the phenomenon of UCF has a firm basis in diagrammatic perturbation theory. 
Here we shall content ourselves with understanding the phenomenon on a more qualitative 
level, following the beautiful discussion of P. A. Lee in Physica 140A, 169 (1986). We begin 
with the multichannel Landauer formula, 



The transmission amplitudes t a j can be represented as a quantum mechanical sum over 
paths 7, 



where Aajij) is the probability amplitude for Feynman path 7 to connect channels j and a. 
The sum is over all such Feynman paths, and ultimately we must project onto a subspace of 
definite energy - this is, in Lee's own words, a 'heuristic argument'. Now assume that the 
■Aajil) are independent complex random variables. The fluctuations in \t a j\ 2 are computed 
from 




(2.151) 




(2.152) 



7 



<M 4 }= E (^(71)^72) ^(73)^(74)} 



71,72 
73.74 



= 2(EK'Wf) 2 + (EK.wi 4 ) 



7 7 




(2.153) 
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where M is the (extremely large) number of paths in the sum over 7. As the 0(M 1 ) term 
is utterly negligible, we conclude 

(M 4 )-(M 2 ) 2 = 
<M 2 > 2 

Now since 

var(G) = (G 2 ) - (G) 2 (2.155) 
/i 2 



= pE{(l^| 2 M'| 2 >-<l^| 2 Xl^f>}> (2-156) 



a, a' 



we also need to know about the correlation between \t a j\ 2 and |i a './'| 2 . The simplest assump- 
tion is to assume they are uncorrelated unless a = a' and j = f, i.e. 

(M 2 \t a >j'\ 2 ) ~ (\t aj \ 2 )(\t aj \ 2 ) = ((l^'l 4 >-<l^f> 2 )^^y , (2.157) 

in which case the conductance is a given by a sum of N 2 independent real random variables, 
each of which has a standard deviation equal to its mean, i.e. equal to (\t a j\ 2 )- According 
to the central limit theorem, then, the rms fluctuations of G are given by 

AG= ^var(G) = ^N c (\t aj \ 2 ) . (2.158) 

Further assuming that we are in the Ohmic regime where G = a L d ~ 2 , with a ~ (e 2 //i) kp~ 1 £, 
and N c « (A; F L) d_1 , we finally conclude 

<M 2 H^4 - AG-^i. (2.159) 

This result is much smaller than the correct value of AG ~ (e 2 /h). 
To reiterate the argument in terms of the dimensionless conductance g, 

9 = J2K\ 2 ^ N C -X =► <l^! 2 > = ^ (2-160) 



|2 



var( 5 ) = ^{(itajfltaYl 2 ) - (\t aj \ 2 )(\ta 

a, a 1 
3,3* 

«E{<m 4 Mm 2 > 2 } 

E(M 2 > 2 -^ 2 -(^) 2 = ^ (2.161) 

(2.162) 



c \m) iv c 2 

aj c c 



V^(5)~^ = 7 (WRONG!) 



2. 5. UNIVERSAL COND UCTANCE FL UCTUATIONS IN DIRTY METALS 



77 



What went wrong? The problem lies in the assumption that the contributions Aajil) are 
independent for different paths 7. The reason is that in disordered systems there are certain 
preferred channels within the bulk along which the conduction paths run. Different paths 
7 will often coincide along these channels. A crude analogy: whether you're driving from 
La Jolla to Burbank, or from El Cajon to Malibu, eventually you're going to get on the 405 
freeway - anyone driving from the San Diego area to the Los Angeles area must necessarily 
travel along one of a handful of high-volume paths. The same is not true of reflection, 
though! Those same two hypothetical drivers executing local out-and-back trips from home 
will in general travel along completely different, hence uncorrelated, routes. Accordingly, 
let us compute var(iV c — g), which is identical to var(g), but is given in terms of a sum over 
reflection coefficients. We will see that making the same assumptions as we did in the case 
of the transmission coefficients produces the desired result. We need only provide a sketch 
of the argument: 

i,3 c 

SO 

var(iV c - g) = £ {<K/ \r ilf ?) - (h/)^, .,| 2 )} 
jj' 

« £ {<></> - <M 2 > 2 } 

= £<h/> , »tf-&) , = 0-z)' < 2 - 1M > 



V 7 var(iV c - g) = y / vai{g) = (l - -) 



(2.165) 



The assumption of uncorrelated reflection paths is not as problematic as that of uncorrelated 
transmission paths. Again, this is due to the existence of preferred internal channels within 
the bulk, along which transmission occurs. In reflection, though there is no need to move 
along identical segments. 

There is another bonus to thinking about reflection versus transmission. Let's express the 
reflection probability as a sum over paths, viz. 



Ir.f 



EA 3 (7)4(7) • (2-166) 



7:7' 



Each path 7 will have a time-reversed mate 7 T for which, in the absence of external magnetic 
fields, 

A,(7) = A J -(7 T )- (2-167) 
This is because the action functional, 

*2 

S[r(t)] = Jdt^mr 2 - V(r) - ^A(r) ■ r} (2.168) 
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satisfies 

S[r(t)} = S[r(-t)} if B = . (2.169) 

There is, therefore, an extra negative contribution to the conductance G arising from phase 
coherence of time-reversed paths. In the presence of an external magnetic field, the path 
7 and its time-reversed mate 7 T have a relative phase rj = 47r$ 7 /0 o , where <I> 7 is the 
magnetic flux enclosed by the path 7. A magnetic field, then, tends to destroy the phase 
coherence between time-reversed paths, and hence we expect a positive magnetoconductance 
(i.e. negative magnetoresistance) in mesoscopic disordered metals. 



Conductance Fluctuations in Metallic Rings 

The conductance of a ring must be periodic under — ► + n<p for any integer n - rings 
with flux differing by an integer number of Dirac quanta are gauge-equivalent, provided no 
magnetic field penetrates the ring itself. The conductance as a function of the enclosed flux 
<E> must be of the form 

G($) = Gci + V G m cos ( + a m ) (2.170) 

where G c \ is the classical (Boltzmann) conductance of the ring. The second harmonic G m =2 
is usually detectable and is in many cases much larger than the m = 1 term. The origin 
of the m = 2 term, which is periodic under <E> — > + |</> , lies in the interference between 
time-reversed paths of winding number ±1. The m = 1 fundamental is easily suppressed, 
e.g. by placing several rings in series. 



2.5.1 Weak Localization 

A more rigorous discussion of enhanced backscattering was first discussed by Altshuler, 
Aronov, and Spivak (AAS) in 1981. AAS showed that there are corrections to Boltzmann 
transport of the form a = a + da, where a = ne 2 T/m* is the Drude conductivity and 
(including a factor of 2 for spin), 

5a = ■ 77 fd*rC(r,r) , (2.171) 



h V ^ 

where £ is the elastic mean free path and C(r, r') is the Cooperon propagator, which satisfies 
|-^(v + ^A(r)) 2 + ^|c(r,r') = 5{r - r') , (2.172) 

where is the inelastic collision time, = L^/D, where D = v F £ = £ 2 /t is the diffusion 
constant. The linear differential operator 

C = -£ 2 (v + ^A(r)) 2 (2.173) 
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bears a strong resemblance to the Hamiltonian of a particle of charge e* = 2e in an external 
magnetic field B = V x A. Expanding in eigenfunctions of L, we obtain the solution 



'c+—)c(r,r') = 6(r-r') 

V-»Y>*(r') 



C(r,r>) = J2 



(2.174) 
(2.175) 
(2.176) 



which resembles a Green's function from quantum mechanics, where the energy parameter 
is identified with — r/r^. There is accordingly a path integral representation for C(r,r'): 



C{r l ,r 2 ) = jdse—/r, j ' Vr[u] exp L j du \_L (JT) 2 _ | A(p) . 



<9r 
<9u 



(2.177) 



r(s)=r 2 



Notice that there is no potential term V{r) in the action of (2.177). The effect of the static 
random potential here is to provide a 'step length' I for the propagator. According to the 
AAS result (2.171), the corrections to the conductivity involve paths which begin and end 
at the same point r in space. The charge e* = 2e which appears in the Cooperon action 
arises from adding up contributions due to time-reversed paths, as we saw earlier. 

Let's try to compute 5a, which is called the weak localization correction to the conductivity. 
In the absence of an external field, the eigenvalues of C are simply k 2 £ 2 , where = 27rn M /L. 
We then have 

/rl d k 1 

A cutoff A is needed in dimensions d > 2 in order to render the integral convergent in the 
ultraviolet. This cutoff is the inverse step size for diffusion: A ~ l^ 1 . This gives, for r <C r^, 



6a *~h 



(d = 3) 



MtJt) (d = 2) 



.-^ (d=l)- 



(2.179) 



Let us now compute the magnetoconductance in d = 2. In the presence of a uniform 
magnetic field, C has evenly spaced eigenvalues 



An = (n +!)(-) , (2.180) 



where £ B = \fhc~Je~B is the magnetic length (for a charge q = —e electron). Each of these 
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Landau levels has a macroscopic degeneracy of 2eBA/hc = A/tt£ b , Thus, 

2 e 2 £ 2 1 
Sa(B) = -—-p 7 r 

n -B n=0 ("+2)4+^ 



2 ^|/4€ 2 

e 



n = 



where we have invoked t/t^ = i /L^. The magnetoconductance is then 

MS) - MO) = £ + - *(i + * ) } , (2.182) 



where 



is the digamma function. If the field is weak, so that £ B 3> £, then 

MB)-M0) = + i^(|) 2 , (2.184) 

which is positive, as previously discussed. The magnetic field suppresses phase coherence 
between time- reversed paths, and thereby promotes diffusion by suppressing the resonant 
backscattering contributions to 5a. At large values of the field, the behavior is logarithmic. 
Generally, we can write 

MB) - MO) = if /(f), (2.185) 



where 



and 



/(x) = lnx + *(i + M = i 24 asx^O 

JV 7 V2 ^ I In x- 1.96351... as x 00 . V ' 



2.6 Anderson Localization 

In 1958, P. W. Anderson proposed that static disorder could lead to localization of electronic 
eigenstates in a solid. Until this time, it was generally believed that disorder gave rise to 
an elastic scattering length i and a diffusion constant D = v F £. The diffusion constant is 
related to the electrical conductivity through the Einstein relation: a = \e 2 D(e F )Af(e F ) . If 
the states at the Fermi level are localized, then D(e F ) = 0. 



2.6. ANDERSON LOCALIZATION 81 



Anderson considered an electron propagating in a random potential: 

h 2 

H = V 2 + V(r), (2.188) 

2m 

where V(r) is chosen from an ensemble of random functions. Physically, V(r) is bounded 
and smooth, although often theorists often study uncorrelated 'white noise' potentials where 
the ensemble is described by the distribution functional 

P[V(r)] =exp{ - fdtrV 2 (r)} , (2.189) 

for which 

(V(r)V(r')) =7<5(r-r') 
A tight binding version of this model would resemble 

w = -* £ (4 s- + 4 c *) + £ £ * 4 ^ , (2.i9i) 

where the single site energies {e^} are independently distributed according to some function 
p(e). The first term can be diagonalized by Fourier transform: 

= -* £ (4 + 4 c *) = -** £ ^ 4 , (2.192) 
<«■> fc 

where 7 fe = z" 1 Y2s elk ' S i where S is a nearest neighbor direct lattice vector and z is the 
lattice coordination number; the bandwidth is 2zt. What happens when we add the random 
potential term to (2.192)? Suppose the width of the distribution p(e) is W, e.g. p(e) = 
W~ l ®(\W 2 — e 2 ), with W <C zt. We expect that the band edges shift from ±zt to 
±(zt+ \ W). The density of states must vanish for |e| > zt + ^W; the regions in the vicinity 
of ±(zt + \W) are known as Lifshitz tails. 

Aside from the formation of Lifshitz tails, the density of states doesn't change much. What 
does change is the character of the eigenfunctions. Suppose we can find a region of con- 
tiguous sites all of which have nW. Then we could form an approximate 
eigenstate by concentrating the wavefunction in this region of sites, setting its phase to 
be constant throughout. This is an example of a localized state. We can think of such a 
state as a particle in a box - the electron binds itself to local fluctuations in the potential. 
Outside this region, the wavefunction decays, typically exponentially. Scattering states are 
then extended states, and are associated with 'average' configurations of the {si}. The 
typical spatial extent of the localized states is given by the localization length £(e). The 
localization length diverges at the mobility edges as 

£(e)~|e-e c r'- (2-193) 

There is no signature of the mobility edge in the density of states itself - Af(s) is completely 
smooth through the mobility edge e c - 



(2.190) 
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£ 



Figure 2.9: Schematic picture of density of states in a disordered system showing mobility 
edges (dashed lines) and localized states in band tails. 



2.6.1 Characterization of Localized and Extended States 

One way of characterizing the localization properties of quantum mechanical eigenstates (of 
TL) is to compute the participation ratio Q. The inverse participation ratio for the state a 
is given by 

Qa 1 = E I^Wf / (E I^Wl 2 ) (discrete) (2.194) 

= Jd d x\i; a (x)^ (^jd d x\ijj a {x)\ 2 ^j (continuous) (2.195) 

Consider the discrete case. If ip a is localized on a single site, then we have £^ h/> a (z) = 1 
for all k, i.e. Q a = 1. But if tp a is spread evenly over N sites, then Q^ 1 = N/N 2 = N^ 1 , 
and Q a = N. Hence, Q a tells us approximately how many states | i ) participate in the 
state | ip a ). The dependence of Q a on the system size (linear dimension) L can be used as 
a diagnostic: 

localized => Q a ot L° (2.196) 
IV'a) extended ^ Q a ot L 13 , (2.197) 

where /3 > 0. 

Another way to distinguish extended from localized states is to examine their sensitivity to 
boundary conditions: 

i/j(x 1 , ... , x /i + L, ... , x d ) = e ie " ip(x 1 , ... , x^, ... , x d ) , (2.198) 
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where ji £ {1, . . . , d}. For periodic boundary conditions 6^ = 0, while antiperiodic boundary 
conditions have = ir. For plane wave states, this changes the allowed wavevectors, so 
that k^ — ► k^ + ir/L. Thus, for an extended state, 

S £cxt = £ pbc - e apbc ~ 5k oc ZT 1 . (2.199) 

For a localized state, 

fei oc oc e - L '^ . (2.200) 
One defines the dimensionless Thouless number 



Th(e, L) = |e pbc - e apbc | • N{e) . (2.201) 
In the vicinity of a mobility edge, a scaling hypothesis suggests 

Th{e,L) = f{L/Z{e)) , (2.202) 
where f{x) is a universal scaling function. 

As the Fermi level passes through the mobility edge into a region of localized states, the 
conductivity vanishes as 

<r(e F ) ~ |e F -e c | s , (2.203) 

where s > 0. Since the density is a continuous function of e p , this can also be turned into 
a statement about the behavior of a(n): 

cr(n) ~ (n - n c ) s @(n - n c ) . (2.204) 



2.6.2 Numerical Studies of the Localization Transition 

Pioneering work in numerical studies of the localization transition was performed by MacK- 
innon and Kramer in the early 1980's. They computed the localization length £ M (W/t, E) 
for systems of dimension M d ~ l x N, from the formula 

where G(£ l ) = (E 1 + it — H)^ 1 is the Green's function, and i,j label transverse sites. 
The average (• • • ) is over disorder configurations. It is computationally very convenient 
to compute the localization length in this manner, rather than from exact diagonalization, 
because the Green's function can be computed recursively. For details, see A. MacKinnon 
and B. Kramer, Phys. Rev. Lett. 47, 1546 (1981). Note also that it is (|G| 2 ) which is 
computed, rather than (G). The reason for this is that the Green's function itself carries 
a complex phase which when averaged over disorder configurations results in a decay of 
(G R Rl ) on the scale of the elastic mean free path I. 
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Figure 2.10: Mock-up of typical raw data from numerical study of localization length for 
d = 2 (left panel) and d = 3 (right panel) systems. For d = 2, Zoo(W/t) is finite and 
monotonically decreasing with increasing IV/t. For d = 3, there is a critical value of 
W/i. For W/t < (W/t) c , £m(W/*) diverges as M —> oo; this is the extended phase. For 
W/t > (W/t) c , %M(W/t) remains finite as M -t oo; this is the localized phase. 



MacKinnon and Kramer computed the localization length by employing the Ansatz of finite 
size scaling. They assumed that 

tM{T> E ) =Mf (^(T' E )/ M ) ' (2 ' 206) 

where f{x) is a universal scaling function which depends only on the dimension d. MK 
examined the band center, at E = 0; for d > 2 this is the last region to localize as W/t is 
increased from zero. A mock-up of typical raw data is shown in fig. 2.10. 

In the d = 2 case, all states are localized. Accordingly, t, M /M — > as M —>■ oo, and £ M (W/i) 
decreases with increasing W/t. In the d = 3 case, states at the band center are extended 
for weak disorder. As W/t increases, ^(W/t) decreases, but with £oc(W/t) still divergent. 
At the critical point, (W/t) c , this behavior changes. The band center states localize, and 
Zoo (W/t) is finite for W/t > (W/t) c . If one rescales and plots Z M /M versus Zoo/M, the 
scaling function f(x) is revealed. This is shown in fig. 2.11, which is from the paper by 
MacKinnon and Kramer. Note that there is only one branch to the scaling function for 
d = 2, but two branches for d = 3. MacKinnon and Kramer found (W/t) c ~ 16.5 for a 
disordered tight binding model on a simple cubic lattice. 
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Figure 2.11: Scaling function Xm/M versus Aqo/M for the localization length Xm of a 
system of thickness M for (a) d = 2, and (b) cZ = 3. Insets show the scaling parameter Aoo 
as a function of the disorder W. From A. MacKinnon and B. Kramer, Phys. Rev. Lett. 
47, 1546 (1981). 



d-2 



2.6.3 Scaling Theory of Localization 

In the metallic limit, the dimensionless conductance of a L d hypercube is given by the 
Ohmic result 

l, ~r 

(2.207) 
(2.208) 

(2.209) 



whereas in the localized limit we have, from Pichard's formula, 

g{L) = 4e- 2L '*. 
It is instructive to consider the function, 

ding 



dlnL 



which describes the change of g when we vary the size of the system. We now know the 
limiting values of /3(g) for small and large g: 



metallic (j> 1) 
localized (j<1) 



(3(g) =d-2 
2L 

(3(g) = — — = In g + const. 



(2.210) 
(2.211) 
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Figure 2.12: Sketch of the /3-function for the localization problem for d = 1,2,3. A critical 
point exists at g = g c for the d = 3 case. 



If we assume that (3(g) is a smooth monotonic function of g, we arrive at the picture in fig. 
2.12. Note that in d = 1, we can compute /3(g) exactly, using the Landauer formula, 

9 = , (2-212) 

where T oc exp(— L/£). From this, we obtain 

(3 d= i(g) = -(1 + g) Ml + g' 1 ) . (2.213) 



It should be stressed that the very existence of a (5- function is hardly clear. If it does exist, 
it says that the conductance of a system of size L is uniquely determined by its conductance 
at some other length scale, typically chosen to be microscopic, e.g. Lq = £. Integrating the 
/^-function, we obtain an integral equation to be solved implicitly for g(L): 

lng(L) 

L \ f ding 



ta <iJ- J m- (2 ' 214) 

Inff(Lo) 



A priori it seems more likely, though, that as L is increased the changes to the conductance 
may depend on more than g alone. E.g. the differential change dg might depend on the 
entire distribution function P(W) for the disorder. 
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Integrating the /3-function: d = 3 

We know (3(g — > 0) — In g and /3(<? — > oo) = 1, hence by the intermediate value theorem there 
is at least one point were (3(g) vanishes. Whenever g satisfies (3(g) = 0, the conductance g 
is scale invariant - it does not change with increasing (or decreasing) system size L. We 
will assume the situation is reflected by the sketch of fig. 2.12, and that there is one such 
point, g c . 

We now apply (2.214). Not knowing the precise form of (3(g), we approximate it piecewise: 

' 1 if 9 > 9+ 

(3(g) ~ I a \n(g/g c ) if <?_ < g < g + (2.215) 
Ing if g < g_ , 

where a = g c (3'(g c )- We determine g + and g- by continuity: 

lng + =lng c +- (2.216) 

a 

Ing- = hig c . (2.217) 

Now suppose we start with g = g c + Sg, where \5g\ <C 1. We integrate out to g = g + and 
then from g + to g S> 1: 



In 



In g+ 

M_ f ding = l_ ln Hg + /g c ) \ 



LoJ J a \n(g/g c ) « \ H%/9c) 

Ing 



lng 



In 

V L_ / 

In g+ 

which together imply 



(J^j = J dlng = ln(g/g + ) , (2.219) 



9(L)=A +9c .±.(g Q -g c )V° , (2.220) 
Lo 



where A + = (eu/g c ) l l a . The conductivity is 



_f!. l = f!.^. (9o _ 9c)1/ «. (2 . 221) 



h L h Lq 

If instead we start with g = g c — Sg and integrate out to large negative values of In g, then 

, 7 " dlng =-ln( H9c/9 -A (2.222) 
LoJ J aln(g/g c ) « \Hg c /g ) ) 
lng 



In 



Ing- 
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which says 

g (L) = e- 2L/f (2.224) 

e=^-(5 c -5 r 1/a , (2-225) 
A_ 

with 

A_ = ln(1/g - } - . (2.226) 

9c • []n(g c /g-)] 

On the metallic side of the transition, when g Q > g c , we can identify a localization length 
through 

9 = 9c/Z, (2-227) 

which says 

Z = ^(9o-9 c )- 1/a • (2-228) 

Finally, since g is determined by the value of the Fermi energy e F . we can define the critical 
energy, or mobility edge e c , through 

g(L ,e c )=g c , (2.229) 

in which case 

(-V V) • (2-230) 



og = g{Lo,e F ) -g c = 



de 

Thus, 5g oc de = (e F — e c ). 
Integrating the /3-function: d = 2 

In two dimensions, there is no fixed point. In the Ohmic limit g S> 1, we have 

0(g) = -- g + 0{g- 2 ) , (2.231) 

where c is a constant. Thus, 
and 

5 (L) = k F £ — c ln(L/£) , (2.233) 

where we have used the Drude result g = k F £, valid for Lq = I. We now see that the 
localization length £ is the value of L for which the correction term is on the same order 
as 9o : £ = ^exp(/c F £/c). A first principles treatment yields c = \- The metallic regime in 
d = 2 is often called the weak localization regime. 
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2 + e dimensions 



At or below d = 2 dimensions, there is no mobility edge and all eigenstates are localized. 
d = 2 is the lower critical dimension for the localization transition. Consider now the 
problem in d = 2 + e dimensions. One has 

(i(g) = e-- g +0(g- 2 ) . (2.234) 

The critical conductance lies at g c = c/e. For e — ► + , this is large enough that higher order 
terms in the expansion of the /3-function can safely be ignored in the metallic limit. An 
analysis similar to that for d = 3 now yields 

g>g c => g(L) = \aV (2.235) 

g<g c g(L) = e~ 2L ^ , (2.236) 

with 

e 2 

° = J L o-(9 - 9c) (2.237) 
2 T 

t=^-(9c-9 ) ■ (2.238) 

Note that a = g c P'(g c ) = +c/g c = e. We thus obtain 

i{e)<x\e-e c \- v (2.239) 

with v = 1 + 0(e). Close to the transition on the metallic side, the conductivity vanishes 
as 

a(e) oc\e - e c \ s . (2.240) 

The relation s = (d—2)v, which follows from the above treatment, may be used to relate the 
localization length and conductivity critical exponents. (In d = 3, MacKinnon and Kramer 
obtained v = s ~ 1.2.) 



2.6.4 Finite Temperature 

In the metallic regime, one obtains from the scaling theory. 

e 2 f2fc 2 ^ 2/1 ! 



^.2W=j [ k <- e -l^(j)) I 2 ' 242 ) 
2 

a d=1 (L) = ^--{4£-2(L-£)} . (2.243) 
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Clearly the d = 1 result must break down for even microscopic L > 3£. The above results 
are computed using the /3-function 

{3(g) = (d-2)-^ + 0(g- 2 ) , (2.244) 

where the coefficients c d are computed from perturbation theory in the disorder. 

At finite temperature, the cutoff becomes min(L, L^), where = yj Dt^ is the inelastic 

scattering length and D = v F £ is the diffusion constant. Suppose that t^(T) oc T~ p as 

T -> 0, so that L, = a(T/T )~ p / 2 , where T Q is some characteristic temperature (e.g. the 
Debye temperature, if the inelastic mechanism is electron-phonon scattering). Then, for 

L 4> >L > 

= °U ~ i ill - z(£Y /2 \ ( 2 - 245 ) 



2 e 2 I /a 



= °U ~ - x ln U 1 " * ln • - ' > (2 ' 246) 




^=i(r) = a d B =1 -2^| of - M , (2.247) 



where <r| is the Boltzmann conductivity. Note that a(T) decreases with decreasing tem- 
perature, unlike the classic low T result for metals, where p(T) = p + AT 2 . I.e. usually 
p(T) increases as T increases due to a concomitant decrease in transport scattering time r. 
Weak localization physics, though, has the opposite effect, as the enhanced backscattering 
is suppressed as T increases and decreases. The result is that p(T) starts to decrease as 
T is lowered from high temperatures, but turns around at low T and starts increasing again. 
This behavior was first observed in 1979 by Dolan and Osheroff, who studied thin metallic 
PdAu films, observing a logarithmic increase in p d=2 (T) at the lowest temperatures. 



Chapter 3 

Linear Response Theory 



3.1 Response and Resonance 



Consider a damped harmonic oscillator subjected to a time-dependent forcing: 

x + 2-ix + ulx = f{t) , (3.1) 

where 7 is the damping rate (7 > 0) and iv is the natural frequency in the absence of 
damping 1 . We adopt the following convention for the Fourier transform of a function H{t): 

00 



H(t)= l—H(cv)e-^ (3.2) 



—00 
00 



H{uj) = jdtH{t)e +iult . (3.3) 

—00 

Note that if H(t) is a real function, then H(—cj) = H*(u). In Fourier space, then, eqn. 
(3.1) becomes 

{<4 - 2ijco - uj 2 ) x{u) = f{u) , (3.4) 

with the solution 

m 

lo'q — 2z7w — uj 2 
where x(uj) is the susceptibility function: 

1 _ -1 

ul - 2i-/uj - LO 2 (u - LO + ){uj - LU_) 

with 

^ ± = -n± V / ^o 2 -7 2 • (3.7) 



*H= ». ol _,,2 ^M/H (3.5) 



*M = .,2_o^.,-,.,2 = ^ w : . ( 3 - 6 ) 



1 Note that f(t) has dimensions of acceleration. 
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The complete solution to (3.1) is then 

oo 

x(t) = / o+x h (t) 3.8 

— oo 

where x h (i) is the homogeneous solution, 

x h (t) = A+e-^+t + A_e _<w -* . (3.9) 

Since Im(a;_j_) < 0, x h (t) is a transient which decays in time. The coefficients A± may 
be chosen to satisfy initial conditions on x(0) and x(0), but the system 'loses its memory' 
of these initial conditions after a finite time, and in steady state all that is left is the 
inhomogeneous piece, which is completely determined by the forcing. 

In the time domain, we can write 

oo 

x(t)= jdt' x (t-t')f(t') (3.10) 

— oo 
oo 

X(s)= |^xHe-^, (3.11) 

— oo 

which brings us to a very important and sensible result: 

Claim : The response is causal, i.e. x(t — t') = when t < t', provided that x(co) is analytic 
in the upper half plane of the variable uj. 

Proof : Consider eqn. (3.11). Of x(w) ls analytic in the upper half plane, then closing in 
the UHP we obtain x(s < 0) = 0. 

For our example (3.6), we close in the LHP for s > and obtain 

X (s > 0) = (-2m) £ ResUiHe-) 

ie~ iuJ + s ie' iuJ - s 
= + , (3.12) 



UJ, — <jJ_ UJ_ — UJ i 



i.e. 



^gp= sin (^M^) e(*) if ujI > 7 2 

\(» - { ° (3-13) 

-jOL= sinh (v9^) ©(*) if ^ 



<7 2 



where 0(s) is the step function: 0(s > 0) = 1, G(s < 0) = 0. Causality simply means that 
events occuring after the time t cannot influence the state of the system at t. Note that, in 
general, x(t) describes the time-dependent response to a 5-function impulse at t = 0. 



3.2. KRAMERS-KRONIG RELATIONS 
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3.1.1 Energy Dissipation 

How much work is done by the force /(t)? Since the power applied is P(t) = f(t) x(t), we 
have 

oo oo 

P(t) = \% (-H m m *** J % /» e +M (3.14) 

— oo — oo 

oo oo 

AE= JdtP(t)= |^(_^)^)|/H| 2 . (3.15) 



— oo 



Separating x(uj) into real and imaginary parts, 

SM = x'M + tx"(w) , (3.16) 

we find for our example 

*V) = , 2 ^"2^ 2 2 = (3-17) 

(wq — lo z y + 47^ 

*"M = — 2 IT, /i 2 2 = _ X"( _W )- ( 3 - 18 ) 

The energy dissipated may now be written 

00 

AE= |^o,x>)|/>)| 2 ■ (3.19) 

—00 

The even function x'(u) is called the reactive part of the susceptibility; the odd function 
x"(w) is the dissipative part. When experimentalists measure a lineshape, they usually are 
referring to features in co x"(u), which describes the absorption rate as a function of driving 
frequency. 

3.2 Kramers-Kronig Relations 

Let x( z ) be a complex function of the complex variable z which is analytic in the upper 
half plane. Then the following integral must vanish, 

c 

whenever Im(£) < 0, where C is the contour depicted in fig. 3.1. 

Now let u) S R be real, and define the complex function x(^) of the real variable u by 

X(w) = lim x(^ + *<0 . (3.21) 

e-+0+ 
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1/ y 



Im(z) 



Re(z) 



Figure 3.1: The complex integration contour C. 



Assuming x( z ) vanishes sufficiently rapidly that Jordan's lemma may be invoked (i.e. that 
the integral of x( z ) along the arc of C vanishes), we have 

du xiy) 



w + ie 



-00 

00 



2tH 



V 



v — U) 



itt8(v — to) 



(3.22) 



where V stands for 'principal part'. Taking the real and imaginary parts of this equation 
reveals the Kramers- Kronig relations: 



J IT V- 



10 



du x'{v) 



x ( U ) = -V 

J IT U 



LU 



(3.23) 



(3.24) 



The Kramers-Kronig relations are valid for any function x( z ) which is analytic in the upper 
half plane. 



If x( z ) is analytic everywhere off the Im(z) = axis, we may write 

00 

J IT V — Z 



(3.25) 



This immediately yields the result 



lim [x(u> + ie) - x(u ~ ie)] = 2i x"M • 



(3.26) 
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As an example, consider the function 
Then, choosing 7 > 0, 



oj + 7 



2 • 



(3.27) 



x(*) 



C&J 1 



Tl UJ — Z UJ 2, + 7 2 



'i/(z + 47) if Im(z) > 



^—i/(z — 77) if Im(z) < . 



(3.28) 



Note that x(z) is separately analytic in the UHP and the LHP, but that there is a branch 
cut along the Re(z) axis, where x(w ± ie) = ±i/(u; ± 17). 

EXERCISE: Show that eqn. (3.26) is satisfied for xM = ^/(^ 2 + 7 2 )- 

If we analytically continue x( z ) from the UHP into the LHP, we find a pole and no branch 
cut: 

X(z) = . (3.29) 

z + l^f 



The pole lies in the LHP at z 



-27. 



3.3 Quantum Mechanical Response Functions 

Now consider a general quantum mechanical system with a Hamiltonian Ho subjected to a 
time-dependent perturbation, 7ii(t), where 



(3.30) 



Here, the {Qi} are a set of Hermitian operators, and the {4>i(t)} are fields or potentials. 
Some examples: 



' —M ■ B(t) 



Hi(t) = { 



magnetic moment - magnetic field 



f cfir g(r) 4>(r, t) density - scalar potential 



_ — ^ f (fir j{r) ■ A(r, t) electromagnetic current - vector potential 
We now ask, what is (Qi(t))? We assume that the lowest order response is linear, i.e. 



(Qi(t)) = dt' X ij(t - 1') <f>j{i!) + 0{<p k ft) . 



(3.31) 



Note that we assume that the 0((f> ) term vanishes, which can be assured with a judicious 
choice of the {Qi} 2 . We also assume that the responses are all causal, i.e. Xij(t — t') = for 



2 If not, define 



{Qi)o and consider (8Qi(t)). 
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t < t' . To compute Xij(t ~ we wm use ^ rs ^ order perturbation theory to obtain (Qi(t)) 
and then functionally differentiate with respect to (f>j(t'): 

S(Qi(t)) 

The first step is to establish the result, 

t 

|tf(t))=T expj-^y^' [W + Wi(O] }|*(<o)> . ( 3 - 33 ) 

to 

where T is the time ordering operator, which places earlier times to the right. This is easily 
derived starting with the Schrodinger equation, 

ih±\nt)) = H(t)\V(t)) , (3-34) 
where H(t) =Hq + Hi(t). Integrating this equation from t to t + dt gives 

\*(t + dt))=(l- ^H{t) dtj | (t) > (3.35) 

| *(to + N dt) > = (l - l -H{t Q + (N — l)dt)j ---(l- %H(to)j | *(t ) > , (3.36) 

hence 

\V(t 2 )) = U(t 2 ,t 1 )\V(t 1 )) (3.37) 

£/(t 2 ,ti) = Texp| - ^JdtH(t)^ . (3.38) 

*i 

U(t 2 ,t\) is a unitary operator (i.e. ?/t = U~ r ), known as the time evolution operator 
between times t\ and t 2 . 

EXERCISE: Show that, for t\ < t 2 < t 3 that U(t 3 ,ti) = U(t 3 ,t 2 ) U(t 2 ,h). 
Iiti<t< t 2 , then differentiating U(t 2 ,t\) with respect to <pi(t) yields 

m^='-U( h , t )Q j U(t,t 1 ), (3.39) 

since dH.(t)/d<f)j(t) = —Qj- We may therefore write (assuming to < t,t') 



{</>;=0} ^ 



^ e -fHo(t-e)/h Q . e -iWo(f-to)/» I ^( to ) _ t ') 

* e -iWot/ft q^') e +iWo to/a | ^( tQ ) )Q( t _ ? (3.40) 
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where 



Qj(t) 



— p iHot/h 



Qj e 



(3.41) 



is the operator Qj in the time-dependent interaction representation. Finally, we have 



Xij(t-t') 



{ - % -{y{t () )\e- moto l h Q j {t') e +inot / h Qi\y(t)) 
+ % -{ *(i) | Qi e-'^^Qjit') e +m ° t0 ' h | *(t ) )}©(*- 

l([Qi(t),Qj(t')])e(t-t'), 



(3.42) 



were averages are with respect to the wavefunction | <I> ) = exp(— i7Yo W^)| *(*o) ), with 
to — ► — oo, or, at finite temperature, with respect to a Boltzmann-weighted distribution of 
such states. To reiterate, 



Mt-t') = -([Q l (t),Q J (t')])e(t-t') 



(3.43) 



This is sometimes known as the retarded response function. 



3.3.1 Spectral Representation 

We now derive an expression for the response functions in terms of the spectral properties 
of the Hamiltonian Hq. We stress that Hq may describe a fully interacting system. Write 
Ho | n ) = hio n | n ) , in which case 

oo 

XiM = \ j dt e iu)t ([Qi(t),Qj(0)]) 
o 

oo 

= % - jdt e iu)t ^ ^2 e ' (3nU}m {( m \Qi\ n )( n \Qj\ m ) e+i( ^ UJn)t 

- ( m | Qj | n > < n \ Qi \ m ) e+^-^t j ? ( 344 ) 

where /3 = l/k B T and Z is the partition function. Regularizing the integrals at t — > oo with 
exp(— et) with e = + , we use 

oo 

j dtei(f0 ~ a+ie)t = u _Q + ie ( 3 - 45 ) 
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to obtain the spectral representation of the (retarded) response function 3 , 



m,n 




Qj 


n ) ( n 


Qi m) (m 




n ) ( n 




m ) 


1 


uj - uj m + uj n + ie uj + uj m - uj n + ie 



(3.46) 



We will refer to this as Xii( w ); formally Xij( w ) nas poles or a branch cut (for continuous 
spectra) along the Re(cj) axis. Diagrammatic perturbation theory does not give us Xij( u )i 
but rather the time- ordered response function, 

x l(t-t')= l -(TQi{t)Qj{t')) 

= \ (Qi(t) Qj{t')) e(t - 1') + % - (Qtf) Qi(t)) e(t' - 1) . (3.47) 



The spectral representation of xfji^) 1S 



m,n 


( m 




n)(n 




m ) ( m 


ft 


n><n 




m)) 


\ 


uj - uj m + uj n - ie uj + uj m - uo n + ie 



" ~~ ~ " (3.48) 
The difference between Xiji^) an d xjji 1 ^) 1S thus only in the sign of the infinitesimal He 
term in one of the denominators. 



Let us now define the real and imaginary parts of the product of expectations values en- 
countered above: 

(m\Qi\n)(n\Qj\m) = A mn (ij) + iB mn (ij) . (3.49) 

That is 4 , 

A mn {ij) = 2 ( m | Qi | n ) ( n | Qj | m ) + 2 ( 171 1 I n ) ( n I Qi I m } (3.50) 
B mn {ij) = ^(m\Qi\n) (n\Qj\m) - — ( m \ Qj \ n ) ( n \ Qi \ m ). (3.51) 

Note that A mn (ij) is separately symmetric under interchange of either m and n, or of i and 
j, whereas B mn (ij) is separately antisymmetric under these operations: 

A mn (ij) = -\- A nm (ij) = A nm (ji) = -\-A mn (ji) (3.52) 
B mn {ij) = -B nm (ij) = B nm (ji) = -B mn (ji) . (3.53) 

We define the spectral densities 

3 The spectral representation is sometimes known as the Lehmann representation. 
4 We assume all the Q i are Hermitian, i.e. Q i = Q\. 
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which satisfy 



In terms of these spectral densities, 



4M = +e-^4( W ) (3.55) 
gfj-u) = -e~^ gfJu) . (3.56) 



oo 

Xi,-H = V Jdv - tt(1 - e~^) g^u) = (3.57) 

— oo 
oo 

*J» = 7> |<fc/ g%{v) + tt(1 - e-^) = (3.58) 



For the time ordered response functions, we find 



2i> 



xi/H = p/fc ^-(1/) - tt(i + e -^) ggH 



I/'' — 



-oo 
oo 



2cj 



(3.59) 
(3.60) 



Hence, knowledge of either the retarded or the time-ordered response functions is sufficient 
to determine the full behavior of the other: 



[xij-M + xJ-iM] 
[xijH - xj-i(w)] 
[x>) + x>)] 
[x»-x»] 



xtanh(i/?M 
[X-7M + XFH] xtanh(i/?M 



(3.61) 
(3.62) 
(3.63) 
(3.64) 



3.3.2 Energy Dissipation 

The work done on the system must be positive! he rate at which work is done by the 
external fields is the power dissipated, 



d_ 
dt 



(V(t)\H(t)\V(t)) 



¥(t) 



ffli(t) 
at 



*W) = -E(ft(tW(i) , 



(3.65) 
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where we have invoked the Feynman-Hellman theorem. The total energy dissipated is thus 
a functional of the external fields {(f>i(t)}: 



oo oo 



W= dt P(t) = - dt dt' Xij (t - 1') fat) fait' 



— oo — oo 
oo 



duj 
2^ 



(3.66) 



Since the {Qi} are Hermitian observables, the {4>i(t)} must be real fields, in which case 
(j)*(uj) = (p-(-uo), whence 



duj 



4tt 

oo 
oo 



where 



Mij(u) = \(-iu)[xij{u) - Xji(-u)] 
Note that as a matrix M(uS) = M^(u), so that M(u>) has real eigenvalues. 



(3.67) 



(3.68) 



3.3.3 Correlation Functions 



We define the correlation function 

S ij {t) = {Q i {t)Q j {t')) , 
which has the spectral representation 

= — ^2 e^ l3hulm ( m | Qi | n ) { n | Qj | n ) 5{uj — u n + u n 



Note that 
and that 



XvM-3Cji(-w) = ^(l-e-^)4-(, 



(3.69) 



(3.70) 



(3.71) 



(3.72) 



This result is known as the fluctuation-dissipation theorem, as it relates the equilibrium 
fluctuations Sij(uj) to the dissipative quantity Xij{w) ~ Xji(~ u )- 
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Time Reversal Symmetry 

If the operators Qi have a definite symmetry under time reversal, say 

TQiT~ l = rnQi , (3.73) 
then the correlation function satisfies 

Sij(u) = rji rjj Sji(u) . (3.74) 

3.3.4 Continuous Systems 

The indices % and j could contain spatial information as well. Typically we will separate 
out spatial degrees of freedom, and write 

S ij (r-r',t-t') = (Q i (r,t)Q j (r',t')) , (3.75) 

where we have assumed space and time translation invariance. The Fourier transform is 
defined as 

oo 

/3„ / J4- „-ik-r 



S(k,u) = Jd d rjdt e- l "- r S{r,t) (3.76) 

— oo 
oo 

^ Jdte +iu)t (Q{k,t)Q{-k^)) . (3.77) 

-oo 



3.4 Example: S = \ Object in a Magnetic Field 

Consider a S = \ object in an external field, described by the Hamiltonian 

H = 7#oS 2 (3.78) 

with Bq > 0. (Without loss of generality, we can take the DC external field Bq to lie along 
z.) The eigenstates are | ± ), with oj± = ±^jBq. We apply a perturbation, 

H l {t) = 1 S-B l {t) . (3.79) 

At T = 0, the susceptibility tensor is 

( - \S^n)(n\S a \ - > < - \S a \n)(n\Sf>\ - ) 



^ (t-\SP\n)(n\S a \-) (- S a \n)(n\SP - )) 

Xa/Hw = t). : — ~ —— t 

h ^ y uj — c<j_ + to n + le <jj + cj_ — <jj n + le J 

_ 7 2 ( ( ~ | g | + )( + | S a | - ) (-\S a \ + )( + \S?\-) 
h \ u> + 7-Bq + ie uj — jBq + it 



(3.80) 



102 CHAPTER 3. LINEAR RESPONSE THEORY 



where we have dropped the hat on x a p{u)) for notational convenience. The only nonzero 
matrix elements are 

hi 2 

X+-M= , I , ■ (3.81) 



-h-y 2 

X-+M = 7T- - , (3-82) 



or, equivalently, 

Xxx{u) = \hrf \ ■ X — — 1 , 1 = +Xyy{u) (3.83) 

[to + 7-Do + *e ^ — 7-do + «e J 
Xxy(^) = \hl 2 { ~ \ ~ . + ^— -) = -x^M. (3.84) 

3.4.1 Bloch Equations 

The torque exerted on a magnetic moment [i by a magnetic field H is N = fx x H, which 
is equal to the rate of change of the total angular momentum: J = N. Since fx = 7 J, 
where 7 is the gyromagnetic factor, we have fx = 7/i x i3". For noninteracting spins, the 
total magnetic moment, M = fXi then satisfies 

^ =jM x H . (3.85) 

Now suppose that H = Hq z+H±(t), where z-H± = 0. In equilibrium, we have M = Mq z, 
with Mq = X0H0, where %o is the static susceptibility. Phenomenologically, we assume 
that the relaxation to this equilibrium state is described by a longitudinal and transverse 
relaxation time, respectively known as T\ and T^: 

M x = jM y H z - -fMzHy - ^ (3.86) 

-£2 

M y = ~fM z H x - 7 M X H Z - ^ (3.87) 

^2 

M z = jM x H y - 7 M y H x - Mz ~ M ° . (3.88) 

J-i 

These are known as the Bloch equations. Mathematically, they are a set of coupled linear, 
first order, time-dependent, inhomogeneous equations. These may be recast in the form 

M a + RapM 13 = , (3.89) 

I38 r 

(T 2 \ 

(3.90) 



with R af3 (t) = T-l - 1 e a0S H 5 (t), r = Tj Mg, and 
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The formal solution is written 

t 



M(t) = jdt' U(t - t') </>(0 + U(t) r/>(0) , (3.91) 



o 

where the evolution matrix, 

U(t) = T exp { - Jdt'R(t') } , (3.92) 

is given in terms of the time-ordered exponential (earlier times to the right). 

We can make analytical progress if we write write M = Mo z + m and suppose | H± | <C Hq 
and |m| <C Mq, in which case we have 

m x = 7 H m y - 7 H y M - — ^ (3.93) 

J-2 

Tin 

m y = 1 H x M Q - 1 H Q m x - -» (3.94) 

J-2 

m z = , (3.95) 

which are equivalent to the following: 

m x + 2T 2 l m x + ( 7 2 # 2 + T 2 2 )m x = 7 M (7 H H X - T 2 x H y - H y ) (3.96) 

m y + 2T 2 l m y + ( 7 2 # 2 + T 2 2 )m y = 7 M (7 #0^ + T 2 l H x + fix) (3.97) 

and m z (t) = m z (0) exp(— t/T\). Solving the first two by Fourier transform, 

( 7 2 ^ + T 2 2 - u? - 2iT 2 2 uj) rh x (u) = 7 M (>y H H x [u) + (iw - T 2 l )H y {u:)) (3.98) 
( 7 2 tf 2 + T 2 - 2 - u; 2 - 2iT 2 - 2 w) m^H = ~f M ( 7 H H y (u) - {iuj - T^)H x {u)) , (3.99) 

from which we read off 

XM = ^HI + t/-^-2 iT2 ^ = (3 - 100) 

X*y(«) = ( jh, -^^ M ° - = - Xvx{u) . (3.101) 

yV 7 - f 2 H 2 + T 2 2 -L0 2 -2iT 2 1 uj Xy y ' V ' 

Note that Onsager reciprocity is satisfied: 

Xxy(v, H ) = y^ yx {uj, H ) = X yx {u, -Hq) = ~Xyx{u, H ) . (3.102) 

The lineshape is given by 

( 7 2 H 2 + T 2 2 -co 2 ) 7 2 H M 
( 7 2 J ff 2 + T 2 - 2 -^) 2 +4 r 2 - 2 ^ 

" ( \ 2 7 H M T 2 l m 
XM = (, 2 H 2 + T 2 2 -^ + 4T 2 2 ^ ' (3 - 104) 



XxxM = ; „ ^2 .„_■> n (3.W3) 



so a measure of the linewidth is a measure of T 2 1 . 
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3.5 Electromagnetic Response 



Consider an interacting system consisting of electrons of charge — e in the presence of a time- 
varying electromagnetic field. The electromagnetic field is given in terms of the 4-potential 
A^ = {A , A): 

£ = _VA°-^ (3.105) 



at 



B = V xA . 

The Hamiltonian for an iV-particle system is 
N 



(3.106) 



f 1 2 1 

i=l ^ ' i<j 

= H(Q)-- c Jd 3 x jP(aj) A"(aj,t) + ^2 Jd 3 x n(x) A 2 (x,t) , 



where we have defined 



iV 



n(x) = ^ <5(x — Xi) 



i=i 



A? 



j p (x) = ~2^5Z {Pi'H® ~ X ») + 5 ( X ~ ^Pi} 

i=l 



(3.107) 

(3.108) 

(3.109) 
(3.110) 



j$(x) = cen(x) . 

Throughout this discussion we invoke covariant/contravariant notation, using the metric 

/-l 0\ 



g^u = 9 



,fJ,U 



10 
10 

V 001/ 



so that 



jfJ, = 9/J.uj = ( ~j ) j J j 1 j ) 

3^ = f 9lw A- = -j°A° +j-A^j-A 



(3.111) 

(3.112) 
(3.113) 
(3.114) 



The quantity jft(x) is known as the paramagnetic current density. The physical current 
density j^x) also contains a diamagnetic contribution: 

M*) = " c = Hi*) + (3-115) 



6A»(x) 



j\x) = -^-n(x)A(x) = -JLfi{ x )A(x) 



3o(x) = • 



(3.116) 
(3.117) 
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The electromagnetic response tensor is denned via 

(j^(x,t)) = —^Jc^JdtK lll/ (xt;x'lf)A v (x',lf) , (3.118) 
valid to first order in the external 4-potential A^. From 

(j*(x,t)) = -^(jV(x,t))A»(x,t)(l-5 ll0 ) , (3.120) 



mc 

we conclude 



K^(xt;x't') = ^([jV( x ,t),f u (x',t')])e(t-t') (3.121) 

47TC 

+ ^2 O'O (*> *)> " " V (1 " <W • 

The first term is sometimes known as the paramagnetic response kernel, Kjl u (x;x') = 
(47ri / ihc 2 ) ( [j%(x), jv(x')] ) @(t—t') is not directly calculable by perturbation theory. Rather, 
one obtains the time-ordered response function K^v{x;x') = (47r/ihc 2 )(Tjfi(x) jv(x')), 
where x^ = (ct, x). 



Second Quantized Notation 

In the presence of an electromagnetic field described by the 4-potential A^ = (ccf),A), the 
Hamiltonian of an interacting electron system takes the form 

H = ^Tld 3 x ^t (aj) |_L^ v + ~ C A ) 2 - ^V) + U(x)\ ip a (x) 

+ \H jd\fd\' ^l{x)^l,{x')v{x - x')^ al {x')^ a {x) , (3.122) 

ct.ct' 

where v(x — x') is a two-body interaction, e.g. e 2 /\x — x'\, and U(x) is the external scalar 
potential. Expanding in powers of A^, 

H (A») = H(0)-- c Jd 3 xjV(x)A»(x) + ^Y^ j<?x^ a {x)^ a {x) A\x) , (3.123) 

where the paramagnetic current density jfi(x) is defined by 

f (x) = ceY,^)^) (3-124) 

CT 

^ = ^ E - (VV>ct(*)) V'ctN} • (3-125) 

CT ^ ' 
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3.5.1 Gauge Invariance and Charge Conservation 

In Fourier space, with q^ = (u>/c,q), we have, for homogeneous systems, 

(U9)) = -^K lw (q)A''(q) . (3.126) 
Note our convention on Fourier transforms: 

H{x) = j-^L H {k)e +ik * (3.127) 

H(k) = Jd 4 xH(x)e- ik x , (3.128) 

where k ■ x = = k ■ x — uit. Under a gauge transformation, A^ — ► A^ + d^A, i.e. 

A»{q) -► A»(q) + iA{q) q» , (3.129) 

where A is an arbitrary scalar function. Since the physical current must be unchanged 
by a gauge transformation, we conclude that K fiu (q) q v = 0. We also have the continuity 
equation, d^j^ = 0, the Fourier space version of which says q^ j^(q) = 0, which in turn 
requires q^ K lxu (q) = 0. Therefore, 



Y J Q ll K fll/ (q) = Y J K,u(q) <f = 



(3.130) 



In fact, the above conditions are identical owing to the reciprocity relations, 



ReK^(q) = +ReK^(-q) (3.131) 
ImV(«) = -TmK vlt (-q) , (3.132) 



which follow from the spectral representation of K^ v {q). Thus, 



gauge invariance •<=>■ charge conservation (3.133) 



3.5.2 A Sum Rule 

If we work in a gauge where ^4° = 0, then E = —c~ 1 A, hence E{q) = iq°A(q), and 

(ji{q)) = -^ K K ij {q)A\q) 

= a tJ (q) EP{q) . (3.134) 
Thus, the conductivity tensor is given by 

%(? 

<rij(q,u) = — -fQj(q,u;) . (3.135) 
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If, in the uj — > limit, the conductivity is to remain finite, then we must have 

oo 

J^Jdt{[jf(x,t)jj(0,0)])e^= % ^8 ij , (3.136) 
o 

where n is the electron number density. This relation is spontaneously violated in a super- 
conductor, where a(oj) oc uj^ 1 as uj — > 0. 



3.5.3 Longitudinal and Transverse Response 

In an isotropic system, the spatial components of K^ u may be resolved into longitudinal 
and transverse components, since the only preferred spatial vector is q itself. Thus, we may 
write 

Kij(q, uj) = K|| (q, uj) & qj + K ± (q, uj) (Sij - q\ qj) , (3.137) 

where q\ = qi/\q\. We now invoke current conservation, which says c/ M K fll/ (q) = 0. When 
v = j is a spatial index, 

q° K 0j + q i Kij = - K 0j + K„ qj , (3. 138) 



UJ 

c 

which yields 

(3.139) 



c 

K 0j (q,uj) = q> KJq,uj) = K j0 {q,uj) 

uj 1 



In other words, the three components of Koj(q) are in fact completely determined by K^{q) 
and q itself. When v = 0, 



= q° K 00 + q l K i0 = - K m - - q 2 K,, , (3.140) 

c uj II 



which says 



K 00 {q,uj) = K\\(q,w) 



(3.141) 



Thus, of the 10 freedoms of the symmetric 4x4 tensor K fll/ (q), there are only two indepen- 
dent ones - the functions K^(q) and K ± (q). 

3.5.4 Neutral Systems 

In neutral systems, we define the number density and number current density as 

N 



n(x) = ^2,5{x - Xi) (3.142) 

i=i 

1 N 

j(x) = — ^ {Pi S ( X ~ X i) + S ( X ~ X i)Pi] ■ (3.143) 
i=l 
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The charge and current susceptibilities are then given by 

X (x,t) = l -([n(x,t),n(0,0)]) (3.144) 

Xij (x,t)= l -{[j i (x,t),j j (0,0)]) . (3.145) 

We define the longitudinal and transverse susceptibilities for homogeneous systems accord- 
ing to 

Xij(Q,^)=X\\(Q,^)QiQj + X±(Q,^)(^ij-QiQj) • (3.146) 
From the continuity equation, 



follows the relation 



V-j + ^=0 (3.147) 

9 

ft to 

xn(<z,w) = - + -2x(9,f). ( 3 - 148 ) 

II m 



EXERCISE: Derive eqn. (3.148). 

The relation between K^ ll/ {q) and the neutral susceptibilities defined above is then 

K 00 (x, t) = -4vre 2 x (x, t) (3.149) 

K ij (x,t) = 4 ^[^6(x)S(t)- Xij (x,t)} , (3.150) 

and therefore 

47re 2 r n , . i , 

^||(9^) = ^{--X||(Q^)} (3-151) 

^ ± (q,u;) = 4 ^ 2 {--x ± (q^)} ■ (3.152) 
3.5.5 The Meissner Effect and Superfluid Density 

Suppose we apply an electromagnetic field E. We adopt a gauge in which A = 0, E = 
—c~ 1 A, and B = V x A. To satisfy Maxwell's equations, we have q ■ A(q,u>) = 0, i.e. 
A(q,uj) is purely transverse. But then 

(j(q,Lo)) = -^-K ± (q,Lo)A(q,io) . (3.153) 

This leads directly to the Meissner effect whenever lim^o K ± (q, 0) is finite. To see this, 
we write 

V x B = V (V ■ A) - V 2 A 

4vr . IdE 

c c at 

= *(-h) K ^-™' iai > A -?W' (3 - 154) 
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which yields 

( v2 " = K ± (-iV,id t )A . (3.155) 

In the static limit, V 2 A = K ± (iV, 0) A, and we define 

^ = limK ± (q,0) . (3.156) 
A L is the London penetration depth, which is related to the superfluid density n s by 

2 

mc 

Us = ~A 2V2 (3.157) 

= n — mlim Xi(q,0) . (3.158) 

Ideal Bose Gas 

We start from 

Xij(Q,t) = j^{[ji(q,t),jj(-q,0)]) (3.159) 

i*W = £n E( 2fc * + «) 4 ^ k+q ■ (3-160) 

k 



For the free Bose gas, with dispersion uo k = hk 2 /2 



m. 



ji(q, t) = (2k t + ft) e'H-'W ^ (3161) 
fc,fc' 

x MlW^^-J (3-162) 

Using 

[AS, CD] = A [5, C] D + AC [B, £>] + C [A, + [A, C] D.B , (3.163) 

we obtain 

[Uq,t),M-q,0)] = ^^(2^+ft)(2A:,+^)e^-^{n K)-n K +g )} , (3.164) 
where n°(u;) is the equilibrium Bose distribution 5 , 

n » = e ^X-l ■ (3 - 165) 



^Recall that fi — in the condensed phase. 
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Thus, 



Xij(q, (J) = > {2ki + qi){2kj + - (3.166) 



hn 
4m 2 



lo + uj q + «e — o; g + ie . 

^/^ ? t+3/2) ""° K T > fet - (3 - 167) 

m 7 {2tt) u + UJk _ q/2 -u k+q/2 + ie 
where no = Nq/V is the condensate number density. Taking the uj = 0, q — > limit yields 



Xlj (q 0, 0) = ft & + - <% , (3.168) 
m m 

where n' is the density of uncondensed bosons. From this we read off 

X,, (q - 0, 0) = - , x (q ^ 0, 0) = - , (3.169) 

H m -L m 

where n = n + n' is the total boson number density. The superfluid density, according to 
(3.158), is n s = n (T). 

In fact, the ideal Bose gas is not a superfluid. Its excitation spectrum is too 'soft' - any 
superfiow is unstable toward decay into single particle excitations. 



3.6 Density-Density Correlations 



In many systems, external probes couple to the number density n(r) = Y^i=i — r i)i an< ^ 
we may write the perturbing Hamiltonian as 

Hi(t) = - Jd\n{r)U(r,t) . (3.170) 
The response 5n = n — (n) is given by 

{Sn(r,t)) = Jd 3 r'Jdt'x(r-r',t-t')U(r',t') (3.171) 

{6h(q,u)) = X (q,u)U(q,v) , (3-172) 
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where 





t\3 




( m | n g | n ) 







= i [dvS{q,v){—± 
n J {lo + u 



S(q,u) 



uj - uj m + uj n + ie uj + uj m - uj n + ie 
1 



2vr 



E 1 



+ ie uj — u + ie 

2 



m | n g | n ) 5(u -oj n + uj m ) . 



(3.173) 
(3.174) 



Note that 



A' 



-iq-Ti 



(3.175) 



i=l 



and that h\ = n_ q . S(q,uj) is known as the dynamic structure factor. In a scattering ex- 
periment, where an incident probe (e.g. a neutron) interacts with the system via a potential 
U(r — R), where R is the probe particle position, Fermi's Golden Rule says that the rate at 
which the incident particle deposits momentum hq and energy huj into the system is given 
by 



2vr 

l(q,oj) = -> c 
m.n 



— fifkOn 



(m;p\Hi\n;p - hq) 5{uj - uj n + uj m ) 



\U(q)\ 2 S(q,uj) . 



(3.176) 



The quantity | U (q) | is called the form factor. In neutron scattering, the "on-shell" con- 
dition requires that the incident energy e and momentum p are related via the ballistic 
dispersion e = p 2 /2m n . Similarly, the final energy and momentum are related, hence 



2m n 2m n 



2„2 



hq ■ p h q 



2m n 



(3.177) 



Hence, for fixed momentum transfer hq, uj can be varied by changing the incident momentum 
P- 

Another case of interest is the response of a system to a foreign object moving with trajectory 
R(t) = Vt. In this case, U(r,t) = U(r- R(t)), and 



so that 



U(q,uj) = jd\jdte- iq r e iwt U(r-Vt) 
= 2it5{uj -q-V)U(q) 

(5n(q,uj)) = 2tt5(uj -q- V)x(q,u) ■ 



(3.178) 
(3.179) 
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3.6.1 Sum Rules 



From eqn. (3.174) we find 



oo 



m\n„ \n 



) {u) n - LO m ) 



= ^zE e_/3? " m ( m l^l n >< n l[^4]l m > 



nvz 
1 



1 



(3.180) 



= — <n,[W,nJ] > = —([«„[«,»»;]]>, 

where the last equality is guaranteed by q — ► — q symmetry. Now if the potential is velocity 
independent, i.e. if 



2m 



i=i 



then with h\ = YliLi e%qri we obtain 



N 



2 iV 



2m 
i=i 

^•E(^*"+^.) 

i=i 

AT N 



(3.181) 



(3.182) 



i=i j=i 



= Nh 2 q 2 /m . 
We have derived the f-sum rule: 



oo 

j d ^uS{q^) 



Nhq 2 
2mV 



(3.183) 



(3.184) 



Note that this integral, which is the first moment of the structure factor, is independent of 
the potential. 

n times 

A 

1 /. 



hv 



(3.185) 



Moments with n > 1 in general do depend on the potential. The n = moment gives 

oo 



* JSr (n(r) n(0)) . 



-iq-r 



(3.186) 



which is the Fourier transform of the density-density correlation function. 
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Compressibility Sum Rule 



The isothermal compressibility is given by 



Kri 



1 dV 

V~dn 



1 dn 
T n 2 d\x 



(3.187) 



Since a constant potential U(r,t) is equivalent to a chemical potential shift, we have 

oo 

du S(q,u) 



1 f du. 

(5n) = x (0, 0)Sn => k T = J~2 J™ J ^ 

— oo 

This is known as the compressibility sum rule. 



(3.188) 



3.7 Dynamic Structure Factor for the Electron Gas 

The dynamic structure factor S(q,oo) tells us about the spectrum of density fluctuations. 
The density operator fi\ = Y2i e tq ' ri increases the wavevector by q. At T = 0, in order for 
( n | h\ | G ) to be nonzero (where | G ) is the ground state, i.e. the filled Fermi sphere), the 
state n must correspond to a particle-hole excitation. For a given q, the maximum excitation 
frequency is obtained by taking an electron just inside the Fermi sphere, with wavevector 
k = k F q and transferring it to a state outside the Fermi sphere with wavevector k + q. For 
\q\ < 2k F , the minimum excitation frequency is zero - one can always form particle-hole 
excitations with states adjacent to the Fermi sphere. For \q\ > 2k F , the minimum excitation 
frequency is obtained by taking an electron just inside the Fermi sphere with wavevector 
k = — k F q to an unfilled state outside the Fermi sphere with wavevector k + q. These cases 
are depicted graphically in fig. 3.2. 



We therefore have 



^max(^) 
^min(^) 



hq 2 hk F q 
2m m 



Hq 2 _ Hk F q 



k 2m 



if q < 2k F 
iiq> 2k F . 



(3.189) 



(3.190) 



This is depicted in fig. 3.3. Outside of the region bounded by w min (q) and uj max (q), there 
are no single pair excitations. It is of course easy to create multiple pair excitations with 
arbitrary energy and momentum, as depicted in fig. 3.4. However, these multipair states 
do not couple to the ground state | G ) through a single application of the density operator 
h\, hence they have zero oscillator strength: ( n | h\ | G ) =0 for any multipair state | n ). 
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( & ) "meJq) ( b ) "mm(q <2k F ) 




(c) co mm (q>2k F ) 

Figure 3.2: Minimum and maximum frequency particle-hole excitations in the free electron 
gas at T = 0. (a) To construct a maximum frequency excitation for a given q, create a hole 
just inside the Fermi sphere at k = k F q and an electron at k' = k + q. (b) For |q| < 2k F the 
minumum excitation frequency is zero, (c) For |q| > 2k F , the minimum excitation frequency 
is obtained by placing a hole at k = — k F q and an electron at kl = k + q. 

3.7.1 Explicit T = Calculation 

We start with 

S(r,t) = (n(r,t)n(0,0) (3.191) 

= /^/(^ elfe - r E<^ (t)e "'^> • ( 3 - 192 ) 

The time evolution of the operator Vi(t) is given by ri(t) = r, +pit/m, where Pi 
Using the result 

e A+B = e A e B e -\[A,B] 
which is valid when [A, [A, B]] = [B, [A, B]] = 0, we have 

g— ik-ri(t) ^ihk 2 t/2m ^—ik-r g—ik-pit/m 

hence 

S(r, t)= I f ^ e ^^ 2 */2 m e ifcT J- ^-ik-r, jk- Pi t/m jk'-rj } q lg5) 



(3.193) 
(3.194) 
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Figure 3.3: Minimum and maximum excitation frequency u in units of e F /h versus wavevec- 
tor q in units of k F . Outside the hatched areas, there are no single pair excitations. 

We now break the sum up into diagonal (i = j) and off-diagonal (i / j) terms. 
For the diagonal terms, with i = j, we have 

l^-ik-Ti e ik-pit/m gifc'-r^ _ ^-ihk-k't/m ^ e i(k-k)-ri ^k-pit/m^ ^ 19Q) 



since the ground state | G ) is a Slater determinant formed of single particle wavefunctions 
i>k{r) = exp(iq • r)/W with q < k F . 



For i ^ j, we must include exchange effects. We then have 



-ik-ri ik-pit/m ik'-r. 



Q q' 



3_ —iHk-qt/m J —ik-ri p ik'r 



e i(q-q' -k)-ri & i{q' '-q+k')-rj 



q q 



e- ink qt ' m \5{k) S(k') - 5{k - k') 5{k + q' - q)} . (3.197) 
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low frequency 
large wavevector 




high frequency 
small wavevector 



Figure 3.4: With multiple pair excitations, every part of (q, to) space is accessible. However, 
these states to not couple to the ground state |G) through a single application of the density 
operator h q . 



Summing over the i = j terms gives 

^ ^ Jk-r „-ihk 2 t/2m f d 3 q 



Wr, t) = e ikr e~ imH ^ J ^ ®{K ~ q) e-^ m , (3.198) 



while the off-diagonal terms yield 

d3k ik-r f d \ f 



x (2vr) 3 |<5(fc) - e +i^t/2m e ^ihk- q t/ m ^ q _ q , _ k A 



(3.199) 

and hence 



S(k,u) = n 2 {2^) A 5{k)5{u) + j 0(fc F - q) ^vr s(u 



hk 2 hk ■ 

2m m / 



-@(k F -\k-q\)27rSL+ r ^- nJ ^)\ (3.200) 
V 2m m / J 

= (2^) V5(fc)5H + e(fc F - 9 ) eflfc + «| - fcp) . 27T* ( w - ^ - ^) . 
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For k,u ^ 0„ then, 



•"F 1 

S(k,uj) = ^ Jdqq 2 Jdx@(^/k 2 + q 2 + 2kqx - k F ) s(u - 



hk 2 hkq 
2m m 



o -1 



m 



2nhk 



m 
4nhk 



k jdu®(u + 2 -^-ki)®(u 



k muj 2 
2~~hk 



The constraints on u are 



k 2 > u > max! k 2 — 



2mw 



k muj 2 
2~~hk 



Clearly u > is required. There are two cases to consider. 
The first case is 



, 9 2muj 



k mtu 2 
2~ Jik 



0<uj< 



hk F k hk 2 

m 2m ' 



which in turn requires k < 2k F . In this case, we have 



m 2 uj 
2^Wk ' 



The second case 



, 9 2muo 
However, we also have that 



hence to is restricted to the range 

hk 



k moo 2 
2~lik 

k muj 
2 ~ ~h~k 



hk F k hk 2 
m 2m 



< k F , 



hk 



\k - 2k F \ < w < — \k + 2k F \ . 
2m 2m 



The integral in (3.201) then gives 



k muj 2 
2~~h~k 
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1 2 

cj/v F k F 

Figure 3.5: The dynamic structure factor S(k,u>) for the electron gas at various values of 
k/kp. 

Putting it all together, 

mk 



S(k,u) = < 



■* 2 ti 2 ' 2v F k 



mfc p 7rfc p 



ifO<ui<v F k-^ 



(j± fc_y 

\v F k 2k F J 



if 



v w k — 



hk 2 

2m 



<u<v Y k+^ 



(3.209) 



Integrating over all frequency gives the static structure factor, 

oo 

S{k) = ^{n{n k ) = j^S(k,u). 

— oo 

The result is 

S ( k ) = {n ifk>2k F 



(3.210) 



Vn 2 



(3.211) 



if k = , 



where n = k F /6ir is the density (per spin polarization). 
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3.8 Charged Systems: Screening and Dielectric Response 
3.8.1 Definition of the Charge Response Functions 

Consider a many-electron system in the presence of a time-varying external charge density 
p ext (r,t). The perturbing Hamiltonian is then 

Hi = -ejd\jd\' 

= ~ e j (2^)3 P h{k) ^ xt( " fc ' t] ■ (3 ' 212) 
The induced charge is — e5n, where Sn is the induced number density: 

Sh(q,u) = ^ x(q,w) PmdaM ■ (3.213) 
We can use this to determine the dielectric function e(q,to): 

V D = 47r PeJrt (3.214) 

V-E = 4ir( Pcxt -e(5n)) . (3.215) 

In Fourier space, 

iq-D(q,u>) = 4irp ext (q,<j) (3.216) 

iq-E(q,uj) =47rp ext (g, w) - 47re(5n(g, u)} , (3.217) 

so that from D(q,u) = e(q,uj) E(q,u>) follows 

1 _ iq-E(q,u) _ 1 Sh(q,u) (3 218) 

ext( 
2 

X(9,w) . (3.219) 



e(g,u;) iq-D(q,w) Zh cxt {q,uj) 

Aire 



A system is said to exhibit perfect screening if 



Q 2 



Aire 2 



e{q -► 0,w = 0) = oo => lim — ^- x(g, 0) = 1 . (3.220) 
Here, x(Qi w ) is the usual density-density response function, 

W ) = ^E 2 2 rV-^ K"l"fll°>| 2 > ( 3 - 221 ) 
ftl/ ^ u;4 — [LO + ze) 2 1X1 1/1 

where we content ourselves to work at T = 0, and where u no = uj n — lo is the excitation 
frequency for the state In). 
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From J charge = crE and the continuity equation 

i( l • 0'char g c(9^)) = -ieu{n(q,w)) = ia(q,u)q- E{q,u) , 



we find 



or 



( 47r /5 C xt(g>^) ~ ^e(5h(q,uj))^a(q,uj) = -iuje(5h(q,uj)) , 
(6h(q,u)) l-e _1 (qr,u/) 



w e-^p cxt (q,u;)-(5h(q,u J ))~ ^(qu) 



Thus, we arrive at 



e(q,uj) q l 



47TZ 

e(q, uj) = H cr(g, u;) 



(3.222) 
(3.223) 



e(g,w)-l. (3.224) 



(3.225) 



Taken together, these two equations allow us to relate the conductivity and the charge 
response function, 

iu e 2 x(q,u) 



(3.226) 



3.8.2 Static Screening: Thomas-Fermi Approximation 

Imagine a time-independent, slowly varying electrical potential <p{r). We may define the 
'local chemical potential' ju(r) as 



H = n{r) - ecj){r) , 



(3.227) 



where is the bulk chemical potential. The local chemical potential is related to the local 
density by local thermodynamics. At T = 0, 



K 1 .,, , ft 2 / n . ,„ , ,\2/3 



/'(>•) = = ^(3vr 2 n + 37r 2 5nW) 



2m 



hence, to lowest order, 



y 3 n 



(3.228) 



(3.229) 



This makes sense - a positive potential induces an increase in the local electron number 
density. In Fourier space, 



(6h(q, uj = 0)) = — (f>{q, uj = 0) 



(3.230) 
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Poisson's equation is — V 2 = 47rp tot , i.e. 

iq-E(q,0) = q 2 4>(q,0) 

= 4vrp cxt (q,0) - 4vre (<5n(g, 0)) 

= 4vr / 5 cxt (q,0)-^^(q,0) 



fi 



(3.231) 
(3.232) 



and denning the Thomas-Fermi wavevector q TF by 



Qtf — 



we have 



fcq,0) 



Gnne 2 
P 

q 2 + g 2 F 



hence 



e(<Jn(g,0)) 



9 2 + <7? F "' xl 



■Pfflrt(9.°) 



e(g,0) = l + 



?TF 



(3.233) 



(3.234) 



(3.235) 



Note that e(q — ► 0, w = 0) = oo, so there is perfect screening. 
The Thomas-Fermi wavelength is A XF = q^p, and may be written as 

7T \i/6 



A, 



/r^a B ~ 0.800 v/r^a B , 



(3.236) 



where r s is the dimensionless free electron sphere radius, in units of the Bohr radius a B = 
k 2 /me 2 = 0.529A, defined by |7r(r s a B ) 3 n = 1, hence r s oc n -1 / 3 . Small r s corresponds to 
high density. Since Thomas- Fermi theory is a statistical theory, it can only be valid if there 
are many particles within a sphere of radius A TP , i.e. 1 7rA 3 F n > 1, or r s <> (vr/12) 1 / 3 ~ 0.640. 
TF theory is applicable only in the high density limit. 

In the presence of a (5-function external charge density p ext (r) = Ze S(r), we have p ext (q, 0) = 
Ze and 



{6h(q,0)) 



q l + Qtf 



(5n(r)) 



Z e ~ r / X TF 
4irr 



Note the decay on the scale of A TF . Note also the perfect screening: 
e(<5n(q^0,u; = 0)> = /) cxt (g^0,^ = 0) = Ze . 



(3.237) 



(3.238) 



3.8.3 High Frequency Behavior of e(q,u) 



We have 



-if ^ r 47re2 r ^ 



(3.239) 
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where 



is the plasma frequency. 



m 



and, at T = 0, 

w ) = ^ElO'KI >r{ - 1 ~. ~T~ } , (3-240) 

where the number density operator is 

'Y^ ie iq r i (1 st quantized) 
h\ = I (3.241) 
, E fe 4 +q A ( 2nd quantized: {V fe , = 5 kk ,) . 

Taking the limit lo — > oo, we find 

oo 

X(9, - - oo) = - ^ £ I < j I nt | > | 2 w .„ = - ^ c/ S(g, a/) . (3.242) 

■7 — oo 

Invoking the /-sum rule, the above integral is nhq 2 /2m, hence 

2 

X (q,^^oo) = -^, (3.243) 

and 

,2 



e- 1 (qr,o;^oo) = l + ^| , (3.244) 



u p = ^ (3.245) 



3.8.4 Random Phase Approximation (RPA) 

The electron charge appears nowhere in the free electron gas response function x°(g,cj). 
An interacting electron gas certainly does know about electron charge, since the Coulomb 
repulsion between electrons is part of the Hamiltonian. The idea behind the RPA is to 
obtain an approximation to the interacting %(g,u;) from the noninteracting X°( ( li UJ ) by 
self-consistently adjusting the charge so that the perturbing charge density is not p ext (r), 
but rather p ext (r, t) — e (8n(r, t)). Thus, we write 



e(Sti{q,w)) = ^x RPA (g^)/S cxt (g^) (3.246) 



47re 2 



p-X°(9,«){^«t(9,w)-e<^(9,w)>} , (3.247) 
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Figure 3.6: Perturbation expansion for RPA susceptibility bubble. Each bare bubble con- 
tributes a factor yp{q,ijj) and each wavy interaction line v(q). The infinite series can be 
summed, yielding eqn. 3.249. 



which gives 



x RPA (<z^) 



x°(g>w) 



(3.248) 



Several comments are in order. 



1. If the electron-electron interaction were instead given by a general ■0(g) rather than 
the specific Coulomb form v(q) = 4-7re 2 /q 2 , we would obtain 



x RPA (q,") = 



X°(flf>w) 



l + v(q)X°(q,u) ' 

2. Within the RPA, there is perfect screening: 

j2J^x RPA (^) = i. 

3. The RPA expression may be expanded in an infinite series, 

x RPA = x -x ^ + x°*x ^ -.. 



(3.249) 



(3.250) 



(3.251) 



which has a diagrammatic interpretation, depicted in fig. 3.6. The perturbative 
expansion in the interaction v may be resummed to yield the RPA result. 



4. The RPA dielectric function takes the simple form 



4-tre 2 



e^\q,u) = l + ^ rX \q^) . 



(3.252) 
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5. Explicitly, 



Ree RPA (g,u;) = 1 + 



Qtf J 1 i kg 
^1 2 4g 



i _ (u - hq 2 /2m) 2 



(^) 5 



w — — hq 2 /2m 



uj + f F g — hq 2 /2m 



/ _( M -ltf/2m)» |||| 

V (^ F <?) 2 

' 2 
7ro; _ ?tf 



w — f F g + hq 2 /2m 



UJ + f F g + hq 2 /2m 
if < cj < v F c/ - hq 2 /2m 



(3.253) 



Trfep 
4q 



f (l - ^"^y ) if ^ - % 2 /2m < u, < ^ + % 2 /2m 



if u > v F q + hq 2 /2m 



6. Note that 



e RPA (q,w^oo) = l-^ 



in agreement with the /-sum rule, and 



e RPA (q^0,^ = 0) = l + % 

9 



in agreement with Thomas-Fermi theory. 
7. At oj = we have 



(3.254) 



(3.255) 



(3.256) 



e RPA (g,0)= l + ^ll + '"' 



q 2 I 2 2g V 4fc2 



In 



g + 2fc F 
2 



(3.257) 



which is real and which has a singularity at q 1 = 2k F . This means that the long-distance 
behavior of (<5n(r)) must oscillate. For a local charge perturbation, p cxt (r) = ZeS(r), 
we have 



(Sn(r)) 



2vr 2 r 



/dqqs'm(qr) < 1 — 



0) 



(3.258) 



and within the RPA one finds for long distances 

{5n{r)) -3 



(3.259) 



rather than the Yukawa form familiar from Thomas-Fermi theory. 
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3.8.5 Plasmons 



The RPA response function diverges when v(q) x°(q,<^) = — 1. For a given value of q, this 
occurs for a specific value (or for a discrete set of values) of u, i.e. it defines a dispersion 
relation u = Q(q). The poles of x RPA an d are identified with elementary excitations of the 
electron gas known as plasmons. 

To find the plasmon dispersion, we first derive a result for x°(q,u;), starting with 



X°(Q,t) = w ([h(q,t),n(-q,0)]) 



i 

ftV ' 



(3.260) 

= < [ E <W> E *l*> ^-,^] ) e*W*)-(*+«))'/ ft 9(f) , (3.261) 

fecr fc',cr' 

where e{k) is the noninteracting electron dispersion. For a free electron gas, e{k) = 
h 2 k 2 /2m. Next, using 

[AB, CD] = A{B, C}D - {A, C}BD + CA{B, D) - C{A, D}B (3.262) 

we obtain 

X°(9> = fir7 E(^ " W e^^*)" 6 ^))*/* 9(t) , (3.263) 



ka 



and therefore 



Here, 



X°(Q^) = 2 



Sk 



(27r) 3 foj - + q) + e{k) + ie 



fk+q fk 



fk = 



1 



e (e(k)-ix)/k B T + 1 



(3.264) 



(3.265) 



is the Fermi distribution. At T = 0, /& = 0(/c P — &), and for a; 3> v F (? we can expand 
X°(q,w) in powers of cj~ 2 , yielding 



x°(g^) 



37T 2 mw 
so the resonance condition becomes 

47re 2 



kl q 2 \ l + 2,fhk F q 



5 V mu 



+ ... 



(3.266) 



= 1 



LO 2 



1 + 1(^1 + 



5 I w 



This gives the dispersion 



) , , 3 ( v f<1 



+ ... 



(3.267) 



(3.268) 



Recall that the particle-hole continuum frequencies are bounded by uJ min (q) and to max (q), 
which are given in eqs. 3.190 and 3.189. Eventually the plasmon penetrates the particle-hole 
continuum, at which point it becomes heavily damped since it can decay into particle-hole 
excitations. 
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Chapter 4 

Magnetism 



4.1 References 

• N. Ashcroft and N. D. Mermin, Solid State Physics 

• R. M. White, Quantum Theory of Magnetism 

• A. Auerbach, Interacting Electrons and Quantum Magnetism 

• A. C. Hewson, The Kondo Problem to Heavy Fermions 



4.2 Introduction 

Magnetism arises from two sources. One is the classical magnetic moment due to a current 
density j: 

m = ^- jd\ r x j . (4.1) 
The other is the intrinsic spin S of a quantum-mechanical particle (typically the electron) : 

C[fi 

m = gn S/h ; fx = = magneton, (4.2) 

2mc 

where g is the g-factor (duh!). For the electron, q = — e and fi Q = —fi B , where fi B = eh/2mc 
is the Bohr magneton. 

The Hamiltonian for a single electron is 

where 7r = p + | A. Where did this come from? From the Dirac equation, 

.^d^ f mc 2 + V ca-ir \ T ^ T ,, 
ih— = , T J * = . (4.4) 

dt \ ca - TV -mc 2 + V y ' 
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The wavefunction ^ is a four-component Dirac spinor. Since mc 2 is the largest term for our 
applications, the upper two components of are essentially the positive energy components. 
However, the Dirac Hamiltonian mixes the upper two and lower two components of One 
can 'unmix' them by making a canonical transformation, 

H — ► Ti! = e iA Ti e~ iA , (4.5) 

where A is Hermitian, to render H! block diagonal. With E = mc 2 + e, the effective 
Hamiltonian is given by (4.3). This is known as the Foldy-Wouthuysen transformation, the 
details of which may be found in many standard books on relativistic quantum mechanics 
and quantum field theory {e.g. Bjorken and Drell, Itzykson and Zuber, etc.). Note that 
the Dirac equation leads to g = 2. If we go beyond "tree level" and allow for radiative 
corrections within QED, we obtain a perturbative expansion, 



<7 = 2 jl + - + 0(Oj , (4.6) 

where a = e 2 /he « 1/137 is the fine structure constant. 1 
There are two terms in (4.3) which involve the electron's spin: 
eh 

H z = rr ■ H (Zeeman term) (4.7) 

2mc 

Ti so = - — a " x (p + | A) (spin-orbit interaction) . (4.8) 
The numerical value for fj, B is 

eh 

fi B = = 5.788 x l(T 9 eV/G (4.9) 

H B /k B = 6.717 x 1(T 5 K/G . (4.10) 

So on the scale of electron volts, laboratory scale fields (H < 10 6 G) are rather small. (And 
~2000 times smaller for nucleons!). 

The thermodynamic magnetization density is defined through 

where F(T, V, H, N) is the Helmholtz free energy. The susceptibility is then 

, , _ 5(M a {r,t)) 6 2 F 
M r '*l r '*)- 5HP{r',t') ~~ 5H<*{r,t)5HP{r>,t>) ' ( } 

where F is replaced by a suitable generating function in the nonequilibrium case. Note that 
M has the dimensions of H. 

1 Note that with /i n = efi/2m p c for the nuclear magneton, g p = 2.793 and g n — —1.913. These results 
immediately suggest that there is composite structure to the nucleons, i.e. quarks. 
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4.2.1 Absence of Orbital Magnetism within Classical Physics 

It is amusing to note that classical statistical mechanics cannot account for orbital mag- 
netism. This is because the partition function is independent of the vector potential, which 
may be seen by simply shifting the origin of integration for the momentum p: 

y^ e -fm{UH-l«r t) ,r i} ) (4 . 13) 

/j/V jN 
±^_P e -mir^ = z{A = oh (4 . 14) 

Thus, the free energy must be independent of A and hence independent of H = V x A, and 
M = —dF/dH = 0. This inescapable result is known as the Bohr- von Leeuwen theorem. 
Of course, classical statistical mechanics can describe magnetism due to intrinsic spin, e.g. 

z Hciscnhcie (H) = n / ^r e/3JE<lJ> Mi e/w ' El (4 - 15) 

i 

Z lBins (H) = e /3JE ^> a%G] . (4.16) 

Theories of magnetism generally fall into two broad classes: localized and itinerant. In the 
localized picture, we imagine a set of individual local moments m; localized at different 
points in space (typically, though not exclusively, on lattice sites). In the itinerant picture, 
we focus on delocalized Bloch states which also carry electron spin. 



4.3 Basic Atomic Physics 

4.3.1 Single electron Hamiltonian 

We start with the single-electron Hamiltonian, 

n = ^(p+-A) 2 + V(r)+g^ B H . s /h+-^s-VV x (p + fA) . (4.17) 
2m \ c / 2m z c z 

For a single atom or ion in a crystal, let us initially neglect effects due to its neighbors. In 
that case the potential V{r) may be taken to be spherically symmetric, so with I = r x p, 
the first term in the spin-orbit part of the Hamiltonian becomes 

1 1 ldV 

so 2m 2 c 2 ^ 2m 2 c 2 r 9r ' ^ ^ 



with W = r(dV/dr). We adopt the gauge A = \H x r so that 
2m 



lm^?~^~ c ) 2m 2mc 8mc 2 ^ ^ ^ ^ 
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Finally, restoring the full SO term, we have 

The last term is usually negligible because rV'{r) is on the scale of electron volts, while 
mc 2 = 511keV. 2 The (H x r) 2 breaks the rotational symmetry of an isolated ion, so in 
principal we cannot describe states by total angular momentum J. However, this effect is of 
order H 2 , so if we only desire energies to order H 2 , we needn't perturb the wavefunctions 
themselves with this term, i.e. we can simply treat it within first order perturbation theory, 
leading to an energy shift g^?( ^ \ {H x r) 2 | *]/ ) in state | n ). 



4.3.2 The Darwin Term 

If V(r) = -Ze 2 jr, then from V 2 (l/r) = -47r5(r) we have 

which is centered at the nucleus. This leads to an energy shift for s-wave states, 

A^-»e = |^ |^(0)| 2 = \Za 2 a\ |^(0)| 2 • £ , (4.23) 
where a = €- ~ is the fine structure constant and a„ = ~ 0.529 A is the Bohr 

he IS" a me- 1 

radius. For large Z atoms and ions, the Darwin term contributes a significant contribution 
to the total energy. 



4.3.3 Many electron Hamiltonian 



The full TV-electron atomic Hamiltonian, for nuclear charge Ze, is then 



N 



i=i 



pj Ze 2 



2m r,- 



N 2 



N 



i<j 1 1 Jl i=l 



N ( 2 



1=1 



(4.24) 



where L = r,x p { and 



C(r) = 



Ze 2 1 Z fe 2 \ 



2\ 2 9 



= TTT ( ~, I 



2m 2 c 2 r 3 /i 2 \^ c / 2a 



(?) 3 ■ 



(4.25) 



2 Exercise: what happens in the case of high Z atoms? 
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The total orbital and spin angular momentum are L = ^ l i and S = ^ s { , respectively. 



The full many-electron atom is too difficult a problem to solve exactly. Generally progress 
is made by using the Hartree-Fock method to reduce the many-body problem to an effective 
one-body problem. One starts with the interacting Hamiltonian 



N 



PL 

2m 



Ze 2 



N 



I <Y* . T* ■ I ' 



(4.26) 



and treats 7i so as a perturbation, and writes the best possible single Slater determinant 
state: 

(4.27) 



V.*„( r i> • • • ' rjv) = (ri) • • • 



where A is the antisymmetrizer, and (f ia (r) is a single particle wavefunction. In second- 
quantized notation, the Hamiltonian is 



« = E T ^ A + E v w vi A' ^ t 



la ' 



(4.28) 



ija 



ijkl 
era' 



where 



2m 



Ze 2 



<Pja(r) 



The Hartree-Fock energy is given by a sum over occupied orbitals: 



(4.29) 
(4.30) 

(4.31) 



ijaa' 



The term is called the direct Coulomb, or "Hartree" term, and V^f, 6, is the exchange 
term. Introducing Lagrange multipliers £j CT to enforce normalization of the {(pi a (r)} and 
subsequently varying with respect to the wavefunctions yields the Hartree-Fock equations: 



5E V 



5<fiia(r 



= 



(4.32) 



(*|*)=i 

?ia <Pur(r) 



2m 

occ 



V 2 



Ze 2 



k>)+EV dV %^*>) 



(4.33) 



which is a set of iV coupled integro-differential equations. Multiplying by (f*(r) and inte- 
grating, we find 



f. = T°" + 9 \ ( V a - a ' - V- aa ' iS \ 

^ia ~ " / j \ * ijji r ijij u aa' J " 

ja' 



(4.34) 
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It is a good approximation to assume that the Hartree-Fock wavefunctions (p^r) are spher- 
ically symmetric, i.e. 

<p ia (r) = R nl (r)Y lm (9,4>) , (4.35) 

independent of a. We can then classify the single particle states by the quantum numbers 
n£ {1,2,...}, I € {0,1, ... ,n — 1}, m { G {—I, . . . , +/}, and m s = ±|. The essential physics 
introduced by the Hartree-Fock method is that of screening. Close to the origin, a given 
electron senses a potential —Ze 2 /r due to the unscreened nucleus. Farther away, though, 
the nuclear charge is screened by the core electrons, and the potential decays faster than 
1/r. (Within the Thomas-Fermi approximation, the potential at long distances decays as 
— Ce 2 a|/r 4 , where C ~ 100 is a numerical factor, independent of Z.) Whereas states of 
different I and identical n are degenerate for the noninteracting hydrogenic atom, when the 
nuclear potential is screened, states of different I are no longer degenerate. Smaller / means 
smaller energy, since these states are localized closer to the nucleus, where the potential 
is large and negative and relatively unscreened. Hence, for a given n, the smaller I states 
fill up first. For a given I and n there are (2s + 1) x (2/ + 1) =41 + 2 states, labeled by 
the angular momentum and spin polarization quantum numbers m l and m s ; this group of 
orbitals is called a shell. 

4.3.4 The Periodic Table 

Based on the energetics derived from Hartree-Fock 3 , we can start to build up the Periodic 
Table. (Here I follow the pellucid discussion in G. Baym's Lectures on Quantum Mechanics, 
chapter 20.) Start with the lowest energy states, the Is orbitals. Due to their lower angular 
momentum and concomitantly lower energy, the 2s states get filled before the 2p states. 
Filling the Is, 2s, and 2p shells brings us to Ne, whose configuration is (Is) 2 (2s) 2 (2p) 6 . 
Next comes the 3s and 3p shells, which hold eight more electrons, and bring us to Ar: 
Is 2 2s 2 2p 6 3s 2 3p 6 = [Ne] 3s 2 3p 6 , where the symbol [Ne] denotes the electronic configuration 
of neon. At this point, things start to get interesting. The 4s orbitals preempt the 3d 
orbitals, or at least most of the time. As we see from table 4.1, there are two anomalies 



The 3d transition metal series ([Ar] core additions) 


Element (A z ) 


Sc 21 


Ti 22 


V 23 


Cr 24 


Mn 25 


Configuration 


4s 2 3d 1 


4s 2 3d 2 


4s 2 3d 3 


4s 1 3d 5 


4s 2 3d 5 


Element (A z ) 


Fe 2e 


Co 27 


Ni 28 


Cu 2y 


Zn 3u 


Configuration 


4s 2 3d B 


4s 2 3d Y 


4s 2 3d« 


4s 1 3d 10 


4s 2 3d 10 



Table 4.1: Electronic configuration of 3d-series metals. 



in the otherwise orderly filling of the 3d shell. Chromium's configuration is [Ar] 4s 1 3d 5 
rather than the expected [Ar] 4s 2 3d 4 , and copper's is [Ar] 4s 1 3d 10 and not [Ar] 4s 2 3d 9 . In 

3 Hartree-Fock theory tends to overestimate ground state atomic energies by on the order of 1 eV per 
pair of electrons. The reason is that electron-electron correlations are not adequately represented in the 
Hartree-Fock many-body wavefunctions, which are single Slater determinants. 
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reality, the ground state is not a single Slater determinant and involves linear combinations 
of different configurations. But the largest weights are for Cr and Cu configurations with 
only one 4s electron. Zinc terminates the 3d series, after which we get orderly filling of the 
4p orbit als. 

Row five reiterates row four, with the filling of the 5s, 4d, and 5p shells. In row six, the 
lanthanide (4f) series interpolates between 6s and 5d (see also table 4.2), and the actinide 
(5f) series interpolates in row seven between 7s and 6d. 



Shell: 


Is 


2s 


2p 


3s 


3p 


4s 


3d 


4p 


5s 


Termination: 


He 2 


Be 4 


Ne iu 


Mg i2 


Ar l8 


Ca 2U 


Zn 3U 


Kr 36 


Sr 38 


Shell: 


4d 


5p 


6s 


4f 


5d 


6p 


7s 


5f/6d 




Termination: 


Cd 46 


Xe 54 


Ba 56 


Lu 71 


Hg 80 


Rn 86 


Ra 88 


No 102 





Table 4.2: Rough order in which shells of the Periodic Table are filled. 



4.3.5 Splitting of Configurations: Hund's Rules 

The electronic configuration does not uniquely specify a ground state. Consider, for exam- 
ple, carbon, whose configuration is Is 2 2s 2 2p 2 . The filled Is and 2s shells are inert. However, 
there are (2) = 15 possible ways to put two electrons in the 2p shell. It is convenient to 
label these states by total L, S, and J quantum numbers, where J = L + S is the total 
angular momentum. It is standard to abbreviate each such multiplet with the label 2S+1 Lj, 
where L = S, P, D, F, H, etc.. For carbon, the largest L value we can get is L = 2, which 
requires S = and hence J = L = 2. This 5-fold degenerate multiplet is then abbreviated 
X D 2 . But we can also add together two / = 1 states to get total angular momentum L = 1 
as well. The corresponding spatial wavefunction is antisymmetric, hence S = 1 in order to 
achieve an antisymmetric spin wavefunction. Since \L — S\ < J < \L + S\ we have J = 0, 
J = 1, or J = 2 corresponding to multiplets 3 P , 3 P l5 and 3 P 2 , with degeneracy 1, 3, and 
5, respectively. The final state has J = L = S = 0: 1 S . The Hilbert space is then spanned 
by two J = singlets, one J = 1 triplet, and two J = 2 quintuplets: 0©0©1©2©2. That 
makes 15 states. Which of these is the ground state? 

The ordering of the multiplets is determined by the famous Hund 's rules : 

1. The LS multiplet with the largest S has the lowest energy. 

2. If the largest value of S is associated with several multiplets, the multiplet with the 
largest L has the lowest energy. 

3. If an incomplete shell is not more than half-filled, then the lowest energy state has 
J = \L — S\. If the shell is more than half-filled, then J = L + S '. 



134 



CHAPTER 4. MAGNETISM 



4f series 




Figure 4.1: Variation of L, 5, and J among the 3d and 4f series. 



Hund's rules are largely empirical, but are supported by detailed atomic quantum many- 
body calculations. Basically, rule #1 prefers large 5 because this makes the spin part of the 
wavefunction maximally symmetric, which means that the spatial part is maximally anti- 
symmetric. Electrons, which repel each other, prefer to exist in a spatially antisymmetric 
state. As for rule ^2, large L expands the electron cloud somewhat, which also keeps the 
electrons away from each other. 

Let's practice Hund's rules on a couple of ions: 



• Mn 4+ : The electronic configuration [Ar] 4s 2 3d 3 has an unfilled 3d shell with three 
electrons. First maximize 5 by polarizing all spins parallel (up, say), yielding 5=1. 
Next maximize L consistent with Pauli exclusion, which says L = 2 + l + = 3. 
Finally, since the shell is less than half- filled, J = \L — S\ = |, and the ground state 
term is 4 F„ 



• Fe 2+ : The electronic configuration [Ar] 4s 2 3d 6 has an unfilled 3d shell with six elec- 
trons, or four holes. First maximize 5 by making the spins of the holes parallel, 
yielding 5 = 2. Next, maximize L consistent with Pauli exclusion, which says 
L = 2 + l + + (— 1) = 2 (adding L z for the four holes). Finally, the shell is 
more than half-filled, which means J = L + 5 = 4. The ground state term is 5 D 4 . 



Nd 3+ : The electronic configuration [Xe] 6s 2 4f 3 has an unfilled 4f shell with three 
electrons. First maximize 5 by making the electron spins parallel, yielding 5 = |. 
Next, maximize L consistent with Pauli exclusion: L = 3 + 2 + l = 6. Finally, since 
the shell is less than half-filled, we have J = \L — 5| = |. The ground state term is 
4 T 
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4.3.6 Spin-Orbit Interaction 

Hund's third rule derives from an analysis of the spin-orbit Hamiltonian, 

N 

n so = J2tt r i)h-s i • (4-36) 

i=i 

This commutes with J 2 , L 2 , and S 2 , so we can still classify eigenstates according to total 
J, L, and S. The Wigner-Eckart theorem then guarantees that within a given J multiplet, 
we can replace any tensor operator transforming as 

nT jM n] = Y, V MM>(<x,P,l)T JM , , (4.37) 

M> 

where TZ corresponds to a rotation through Euler angles a, (3, and 7, by a product of a 
reduced matrix element and a Clebsch-Gordon coefficient: 

< JM I T JHM „ I J'M> ) = C( K J M J M , ^1) <J||:Z>|| J*) . (4.38) 

In other words, if two tensor operators have the same rank, their matrix elements are 
proportional. Both 7i so and L ■ S are products of rank L = 1, S = 1 tensor operators, hence 
we may replace 

n so ^H so = AL-S , (4.39) 

where A = A(N, L, S) must be computed from, say, the expectation value of 7i so in the 
state I JLSJ). This requires detailed knowledge of the atomic many-body wavefunctions. 
However, once A is known, the multiplet splittings are easily obtained: 

H SQ = \A(J 2 -L 2 -S 2 ) 

= \h 2 A ( J( J + 1) - L(L + 1) - S{S + 1)) . (4.40) 

Thus, 

E(N, L, S, J) - E(N, L, S, J - 1) = A J fr 2 . (4.41) 

If we replace C( r i) by its average, then we can find A by the following argument. If the 
last shell is not more than half filled, then by Hund's first rule, the spins are all parallel. 
Thus S = 7^N and s i = S/N, whence A = (Q/2S. Finding (£) is somewhat tricky. For 
_£-i ^ r/a B <C 1, one can use the WKB method to obtain ip(r = a B /Z) ~ y/Z, whence 



and 

A ~ Z 2 a 2 h~ 2 Ry , (4.43) 

where a = e 2 /he ~ 1/137. For heavy atoms, Za ~ 1 and the energy is on the order of that 
for the outer electrons in the atom. 
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For shells which are more than half filled, we treat the problem in terms of the holes relative 
to the filled shell case. Since filled shells are inert, 

H«> = -T, i r s j > ( 4 - 44 ) 

where = 41 + 2 — N. I - and s- are the orbital and spin angular momenta of the holes; 

L = — J2j lj an d S = — J2j 8j- We then conclude A = —(Q/2S. Thus, we arrive at Hund's 
third rule, which says 

N < 2L + 1 (< half-filled) A > =► J = \L — S\ (4.45) 

N>2L + 1 (> half-filled) => A<0 J=\L + S\. (4.46) 

4.3.7 Crystal Field Splittings 

Consider an ion with a single d electron {e.g. Cr 3+ ) or a single d hole {e.g. Cu 2+ ) in a cubic 
or octahedral environment. The 5-fold degeneracy of the d levels is lifted by the crystal 
electric field. Suppose the atomic environment is octahedral, with anions at the vertices of 
the octahedron (typically 2 ~ ions). In order to minimize the Coulomb repulsion between 
the d electron and the neighboring anions, the d x2 _ y2 and d 3x2 _ r2 orbitals are energetically 

disfavored, and this doublet lies at higher energy than the {d xy ,d xz ,d yz } triplet. 
The crystal field potential is crudely estimated as 

(nbrs) 

^cf = J2 V ( r ~ R h (4-47) 
R 

where the sum is over neighboring ions, and V is the atomic potential. 

The angular dependence of the cubic crystal field states may be written as follows: 

= ^ 2 {r) + ^ 2 {r) 
d 3z 2_ r 2{r) =Y 2fi {r) 

d xy {r) = ^_ 2 {r) - ^ 2 {r) 

d yz {r) = ^l2,-i(r) - f 2 Y 2 ,i(r) ■ (4.48) 

Note that all of these wavefunctions are real. This means that the expectation value of 
L z , and hence of general L a , must vanish in any of these states. This is related to the 
phenomenon of orbital quenching , discussed below. 

If the internal Hund's rule exchange energy J H which enforces maximizing S is large com- 
pared with the ground state crystal field splitting A, then Hund's first rule is unaffected. 
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Figure 4.2: Effect on s, p, and <i levels of a cubic crystal field. 



However, there are examples of ions such as Co 4+ for which J H < Vcf- In such cases, the 
crystal field splitting wins and the ionic ground state is a low spin state. For Co 4+ in an 
octahedral crystal field, the five 3d electrons all pile into the lower 3-fold degenerate tig 



manifold, and the spin is S 



2' 



When the Hund's rule energy wins, the electrons all have 



parallel spin and S = §, which is the usual high spin state. 



4.4 Magnetic Susceptibility of Atomic and Ionic Systems 

To compute the susceptibility, we will need to know magnetic energies to order H 2 . This 
can be computed via perturbation theory. Treating the H = Hamiltonian as TCq, we have 

1 e 2 ^ 

E n (H) = E n (0) + -fi B H .( n \L + 2S\n) + — ^ (n\ ^(Jf x rf \n) 

i=l 

1 9 „ „ v-^ (n\L a + 2S a \n')(n' I + 25^ I n ) , „ 
+ -^H a H^Y, — En _ En , — + °( H ) > ( 449 ) 

where Z ion is the number of electrons on the ion or atom in question. Since the (H x rj 2 
Larmor term is already second order in the field, its contribution can be evaluated in 
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One-eledron energy diagrams 
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Figure 4.3: The splitting of one-electron states in different crystal field environments. 

first order perturbation theory, i.e. by taking its expectation value in the state | n). The 
(L-\-2S) ■ H term, which is linear in the field, is treated in second order perturbation theory. 



4.4.1 Filled Shells: Larmor Diamagnetism 

If the ground state | G ) is a singlet with J G ) = L G ) = S G) = 0, corresponding to a 
filled shell configuration, then the only contribution to the ground state energy shift is from 
the Larmor term, 



AE (H) 



e 2 H 2 
Ylrac 2 



(4.50) 



and the susceptibility is 



X 



N d 2 AE t 







V dH 2 



ne 

Qmc 2 



(4.51) 



i=l 



where n = N/V is the density of ions or atoms in question. The sum is over all the electrons 
in the ion or atom. Defining the mean square ionic radius as 



-i ^ion 

^ 2 ) = ^-< G l E^ 2 l G >' 



i=l 



we obtain 



X 



ne 

Qmc 2 



(e 2 \ 2 (r 2 ) 



(4.52) 



(4.53) 
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Note that X is dimensionless. One defines the molar susceptibility as 
x moiar = N ^ /n = -\Z ion N A al[^\{r / a B ) 2 ) 

= -7.91 x 10~ 7 Z ion ((r/a B ) 2 ^ cm 3 /mol . (4.54) 

Typically, {{r/a B ) 2 ) ~ 1. Note that with na% ~ 0.1, we have |X| < 10~ 5 and M = XH is 
much smaller than H itself. 



Molar Susceptibilities of Noble Gas Atoms and Alkali and Halide Ions 


Atom or 
Ion 


Molar 
Susceptibility 


Atom or 
Atom or Ion 


Molar 
Susceptibility 


Atom or 
Atom or Ion 


Molar 
Susceptibility 






He 


-1.9 


Li+ 


-0.7 


F" 


-9.4 


Ne 


-7.2 


Na+ 


-6.1 


cr 


-24.2 


Ar 


-19.4 


K+ 


-14.6 


Br _ 


-34.5 


Kr 


-28 


Rb+ 


-22.0 


I 


-50.6 


Xe 


-43 


Cs+ 


-35.1 



Table 4.3: Molar susceptibilities, in units of 10 6 cm 3 /mol, of noble gas atoms and alkali 
and halide ions. (See R. Kubo and R. Nagamiya, eds., Solid State Physics, McGraw-Hill, 
1969, p. 439.) 



4.4.2 Partially Filled Shells: van Vleck Paramagnetism 

There are two cases to consider here. The first is when J = 0, which occurs whenever the 
last shell is one electron short of being half-fille. Examples include Eu 3+ (41*), Cr 2+ (3d 4 ), 
Mn 3+ (3d 4 ), etc. In this case, the first order term vanishes in AE , and we have 

„ P 2 z i°n \(n\L z + 2S Z \G) 2 

* = -^(°lX>?|G) + 2n,gE |V ' ' - (4.55) 

i=l n^0 n 

The second term is positive, favoring alignment of M with H. This is called van Vleck 
paramagnetism, and competes with the Larmor diamagnetism. 

The second possibility is J > 0, which occurs in all cases except filled shells and shells which 
are one electron short of being half-filled. In this case, the first order term is usually domi- 
nant. We label the states by the eigenvalues of the commuting observables { J 2 , J z , L 2 , S 2 }. 
From the Wigner-Eckart theorem, we know that 

< JLSJ Z \L + 2S\ JLSJ' Z ) = J, L, S) < JLSJ Z \ J \ JLSJ' Z > , (4.56) 

where 

^>=t+ ^i;^ +1) 
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Figure 4.4: Reduced magnetization curves for three paramagnetic salts and comparison with 
Brillouin theory predictions. C(x) = Bj^ OQ (x) = ctnh (x) — x _1 is the Langevin function. 



is known as the Lande g-factor. Thus, the effective Hamiltonian is 

H cS = g^ B J-H/h. (4.58) 

The eigenvalues of 7i eS are E- = jjH, where j £ {— J, ...,+J} and 7 = <? L /U B . The 
problem is reduced to an elementary one in statistical mechanics. The partition function is 

Z = e" W = y e -nH/k B T = smh ((J + \)lH/k T) 

sinh (-yH/2k B T) y ' 

The magnetization density is 

M = -y^jj = n^JBjiJjH/k.T) , (4.60) 
where Bj(x) is the Brillouin function, 

Bj(x) = [l + ij) ctnh [(1 + ij)x]-^j ctnh (x/2J) . (4.61) 
The magnetic susceptibility is thus 

= ( Jg L ) 2 (na 3 B ) (e 2 /hc) 2 (j^) B f j(g» B JH/k B T) (4.62) 



X{H = ^T) = \{g^ B fn J -^±^. 



k B T 



3 1 

The inverse temperature dependence is known as Curie's law. 



(4.63) 



Does Curie's law work in solids? The 1/T dependence is very accurately reflected in insu- 
lating crystals containing transition metal and rare earth ions. We can fit the coefficient of 
the 1/T behavior by defining the 'magneton number' p according to 

2 

X(T) = n/4 ^ry • (4.64) 
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The theory above predicts 

p = g h y/j(J + l) . (4.65) 

One finds that the theory works well in the case of rare earth ions in solids. There, the 4f 
electrons of the rare earths are localized in the vicinity of the nucleus, and do not hybridize 
significantly with orbitals from neighboring ions. In transition metal compounds, however, 



Calculated and Measured Magneton Numbers of Rare Earth Ions 




Electronic 


Ground State 


magneton 


magneton 


Ion 


Configuration 


Term W+VLj 


^theory 


Pexpt 


La 3 + 


[Xe] 4f u 




0.00 


< 


Ce 3 + 


[Xe] 4f 1 


b 5/2 


2.54 


2.4 


p r 3+ 


[Xe] 4f 2 




3.58 


3.5 


Nd 3+ 


[Xe] 4f 3 


4 T 

i 9/2 


3.62 


3.5 


p m 3+ 


[Xe] 4f 4 


5 T 

M 


2.68 




Sm 3 + 


[Xe] 4f 5 


6H 5/2 


0.84 


1.5 


Eu 3 + 


[Xe] 4f 6 


7 Fo 


0.00 


3.4 


Gd 3 + 


[Xe] 4f 7 


D 7/2 


7.94 


8.0 


Tb 3 + 


[Xe] 4f 8 


% 


9.72 


9.5 


Dy 3 + 


[Xe] 4f 9 


Sir 

n 15/2 


10.63 


10.6 


Ho 3+ 


[Xe] 4f iU 


5 T 
L 8 


10.60 


10.4 


Er 3+ 


[Xe] 4f 11 


4 I 

i 15/2 


9.59 


9.5 


Tm 3+ 


[Xe] 4f 12 


3 H 


7.57 


7.3 


Yb 3 + 


[Xe] 4f 13 


b 7/2 


4.54 


4.5 


Lu 3 + 


[Xe] 4f 14 


la 


0.00 


< 



Table 4.4: Calculated and measured effective magneton numbers p for rare earth ions. 
(From N. W. Ashcroft and N. D. Mermin, Solid State Physics.) The discrepancy in the 
cases of Sm and Eu is due to the existence of low-lying multiplets above the ground state. 

one finds poor agreement except in the case of S states (L = 0). This is because crystal 
field effects quench the orbital angular momentum, effectively rendering L = 0. Indeed, as 
shown in Table 4.5, the theory can be rescued if one ignores the ground state terms obtained 
by Hund's rules, and instead takes L = and J = S, yielding g L = 2. 

4.5 Itinerant Magnetism of Noninteracting Systems 
4.5.1 Pauli Paramagnetism 

In a metal, the conduction electrons are delocalized and described by Block states. If we 
ignore the orbital effects of the magnetic field, we can easily compute the susceptibility at 
low fields and temperatures. At T = and H = 0, j and [ electrons fill respective Fermi 
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Calculated and Measured Magneton Numbers of Transition Metal Ions 


Ion 


Electronic 
Configuration 


Ground State 
Term ( 25+1 )L J 


magneton 
J=\L±S\ 


magneton 
J=S 

Lrii^ui y 


magneton 

^GXpt 


Ti 3 + 


[Ar] 3d 1 

L J 


2 D 

3/2 


1.55 


1.73 




v 4 + 


[Arl 3d 1 

L J 


2 D 

3/2 


1.55 


1.73 


1.8 


v 3 + 


[Ar] 3d 2 


L 2 


1.63 


2.83 


2.8 


V 2+ 


[Ar] 3d 3 


3/2 


0.77 


3.87 


3.8 


Cr 3+ 


[Ar] 3d 3 


4 F , 

L 3/2 


0.77 


3.87 


3.7 


Mn 4+ 


[Ar] 3d 3 


4 Fn/r, 


0.77 


3.87 


4.0 


Cr 2 + 


[Ar] 3d 4 


5 n 


0.00 


4.90 


4.8 


Mn 3+ 


[Ar] 3d 4 


5 n 

u 


0.00 


4.90 


5.0 


Mn 2 + 


[Ar] 3d 5 


6c 
^5/2 


5.92 


5.92 


5.9 


Fe 3 + 


[Ar] 3d 5 


6c 
^5/2 


5.92 


5.92 


5.9 


Fe 2 + 


[Ar] 3d 6 


5 n 

U 4 


6.70 


4.90 


5.4 


Co 2 + 


[Ar] 3d 7 


4F 9/2 


6.54 


3.87 


4.8 


Ni 2 + 


[Ar] 3d 8 


3 F 4 


5.59 


2.83 


3.2 


Cu 2 + 


[Ar] 3d y 


* D 5/2 


3.55 


1.73 


1.9 



Table 4.5: Calculated and measured effective magneton numbers p for transition metal 
ions. (From N. W. Ashcroft and N. D. Mermin, Solid State Physics.) Due to the orbital 
quenching, the angular momentum is effectively L = 0. 



seas out to wavevector k F . In an external field H, the Zeeman interaction splits the energies 
of the different polarization states: 

H z = /i B cr • H . (4.66) 
Taking H = Hz, and summing over all electrons, the Zeeman Hamiltonian becomes 

H z =fi B H(N^-N l ) , (4.67) 
and the magnetization density is (still at T = 0) 

Now since the energies of the j and j electrons are shifted by ±n B H, the change in their 
number is 

= -AA T = fi B H ■ \g{e F ) V , (4.69) 

where g(e F ) is the density of states per unit volume (including both spin species), at the 
Fermi energy. Putting this all together, we find 

M = filg(e F )H = X P H , (4.70) 
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where X P = m|^(£ f ) is the Pauli susceptibility. The Pauli susceptibility is positive, and 
hence is paramagnetic. 

Using the formula for the density of states, 

9ie r ) = ^ , (4-71) 

we find 

1 m* ( e 2 \ 2 

x - = ^™ (s) ( **->- (4 ' 72) 

Using e 2 /hc ~ 1/137.036 and assuming /c p 1, we find X p ~ 10 6 , which is comparable 
in magnitude (though opposite in sign) from the Larmor susceptibility of closed shells. 



4.5.2 Landau Diamagnetism 

Next, we investigate the orbital contribution. We assume a parabolic band, in which case 

H=-±;(p+iA) 2 + wH. (4.73) 

Appealing to the familiar results of a quantized charged particle in a uniform magnetic field, 
the energy levels are given by 

h 2 k 2 

e{n, k z , a) = (n + \) Huj c + afi B H + ^-f , (4.74) 

where uj c = eH/m*c is the cyclotron frequency. Note that [i B H = (— ) - ^hu> c . The three- 
dimensional density of states is a convolution of the two-dimensional density of states, 

1 



92d(e) = , (4.75) 



n=0 

where £ = ^Jhc/eH is the magnetic length, and the one- dimensional density of states, 

1 dk \fm? 1 , , . 

"^TT (4 ' 76) 

Thus, 

g(e) = ^- — T T 9(£ ~ £wg ) (4 77) 
Thus, the grand potential, 

oo 

Q(T, V, n, H) = -V k B T jdeg{e) lnjl + e ^- £ )/ fc B rj ( 4 . 78 ) 
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may be written as the sum, 



m*eH 



n(T, V, H) = -l^ T v__ E E ~ , 



n=0 a=±l 



with A = m* /m, 
and 



oo 

F( I /) = y"^ln{l + e ("- w )^} . 
o 

We now invoke the Euler-MacLaurin formula, 

oo „ 

E / w = /(*) + 1/(0) - n /'(°) + rfe no) 



(4.79) 

(4.80) 
(4.81) 



+ ... 



(4.82) 



n=0 



which gives 

VKTm* 3/2 



n = — 



2V2 



*0/Z I P 

E| JdeF(e) + ^ c F(^) + ^(hv c ) 2 F'(^) + ... 



(4.83) 



We now sum over a and perform a Taylor expansion in frw c oc i7, yielding 



n(T,V,fi,H) 



Vk B Tm* 3/2 



W + 1 (a 2 - s) (^c) 2 + o(^ 4 ) 



a 2 



1 + 2 (! " 3*0 ^H) 2 + °(^ 4 ) I V(T, V, fj,, 0) 



Thus, 



5^2 



H=0 



and the zero field magnetic susceptibility is 



X 



dn 



(4.84) 



(4.85) 



(4.86) 



The quantity X P = fi B (dn/dfi) is simply the finite temperature Pauli susceptibility. The 
orbital contribution is negative, i.e. diamagnetic. Thus, X = X P + X L , where 



^ = -Um/m*) 2 X T 



(4.87) 



is the Landau diamagnetic susceptibility. For free electrons, A = m/m* = 1 and X L = 
— \x p resulting in a reduced - but still paramagnetic - total susceptibility. However, in 
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semiconductors it is common to find m* ~ 0.1m, in which case the Landau diamagnetic 
term overwhelms the Pauli paramagnetic term, and the overall susceptibility is negative. 

In order to probe X P without the diamagnetic X L contribution, nuclear magnetic resonance 
(NMR) is used. NMR measures the effect of electron spins on the nuclear spins. The two 
are coupled via the hyperfine interaction, 

n M = 2 ^ r f N |x ■ i - s ■ i + (f • s)(r ■ i) I + 167r ^B^ s . j §{r) (488) 

where g N is the nuclear (/-value, i" is the nuclear total angular momentum, and fi N is the 
nuclear magneton. 

4.6 Moment Formation in Interacting Itinerant Systems 
4.6.1 The Hubbard Model 

A noninteracting electron gas exhibits paramagnetism or diamagnetism, depending on the 
sign of X, but never develops a spontaneous magnetic moment: M(H = 0) = 0. What gives 
rise to magnetism in solids? Overwhelmingly, the answer is that Coulomb repulsion between 
electrons is responsible for magnetism, in those instances in which magnetism arises. At 
first thought this might seem odd, since the Coulomb interaction is spin-independent. How 
then can it lead to a spontaneous magnetic moment? 

To understand how Coulomb repulsion leads to magnetism, it is useful to consider a model 
interacting system, described by the Hamiltonian 

H = ~ \ E % ( C *W + 4 C -) +U ^Z U i\ n il + t*B H • E 4 °afl C */3 • ( 4 - 89 ) 

This is none other than the famous Hubbard model, which has served as a kind of Rosetta 
stone for interacting electron systems. The first term describes hopping of electrons along 
the links of some regular lattice {{ij) denotes a link between sites i and j). The second 
term describes the local (on-site) repulsion of electrons. This is a single orbital model, so 
the repulsion exists when one tries to put two electrons in the orbital, with opposite spin 
polarization. Typically the Hubbard U parameter is on the order of electron volts. The last 
term is the Zeeman interaction of the electron spins with an external magnetic field. Orbital 
effects can be modeled by associating a phase exp(iA ri j) to the hopping matrix element t 

between sites i and j, where the directed sum of A- around a plaquette yields the total 
magnetic flux through the plaquette in units of 4> = hc/e. We will ignore orbital effects 
here. Note that the interaction term is short-ranged, whereas the Coulomb interaction falls 
off as 1/| Ri — Rj\. The Hubbard model is thus unrealistic, although screening effects in 
metals do effectively render the interaction to be short-ranged. 

Within the Hubbard model, the interaction term is local and written as Un^n^ on any given 
site. This term favors a local moment. This is because the chemical potential will fix the 
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mean value of the total occupancy + in which case it always pays to maximize the 



difference \n^ — raj 



4.6.2 Stoner Mean Field Theory 

There are no general methods available to solve for even the ground state of an interacting 
many-body Hamiltonian. We'll solve this problem using a mean field theory due to Stoner. 
The idea is to write the occupancy n icr as a sum of average and fluctuating terms: 

n la = (n ia ) + Sn ia . (4.90) 

Here, (n ic ) is the thermodynamic average; the above equation may then be taken as a defi- 
nition of the fluctuating piece, Sn ia . We assume that the average is site-independent. This 
is a significant assumption, for while we understand why each site should favor developing 
a moment, it is not clear that all these local moments should want to line up parallel to 
each other. Indeed, on a bipartite lattice, it is possible that the individual local moments 
on neighboring sites will be antiparallel, corresponding to an antiferromagnetic order of the 
pins. Our mean field theory will be one for ferromagnetic states. 

We now write the interaction term as 

(fluct 6 ) 2 

, * V 

n i\ n ii = ( n t^ ( n |) + ( n t^ ^il + ( n l^ ^ n it + ^ n il (4.91) 

= -<" T > ( n i) + (™ T ) n a + K) n i\ + °(( 5n ) 2 ) 

= \(m 2 - n 2 ) + \n (n iT + n^) + \m (n- ? - n^) + 0((5n) 2 ) , 

where n and m are the average occupancy per spin and average spin polarization, each per 
unit cell: 

n = (n ; ) + (n T ) (4.92) 
m = (n ; ) - (n T ) , (4.93) 

i.e. (n a ) = \{n — am). The mean field grand canonical Hamiltonian K, = Tt — /jj\f, may 
then be written as 



C la C ia 



^ MF = -\ E h 3 (4s> + C U«) - 0* - §H E • 

+ (/x B tf + \Um) Y, * 4a c ia + i^sitcs U(m 2 - n 2 ) , (4.94) 



where we've quantized spins along the direction of H, defined as z. You should take note 
of two things here. First, the chemical potential is shifted downward (or the electron ener- 
gies shifted upward) by an amount ^Un, corresponding to the average energy of repulsion 
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with the background. Second, the effective magnetic field has been shifted by an amount 
^Um/fi B , so the effective field is 

H cS = H+^. (4.95) 
The bare single particle dispersions are given by e a (k) = —i(k) + a/j, B H, where 

t(k) = j2 t ( R ) e ~ ikR > ( 4 - 96 ) 

R 

and = tiRj^ — Rj). Including the mean field effects, the effective single particle dispersions 
become 

e a {k) = -i{k) - \Un + (fi B H + \Um) a . (4.97) 

We now solve the mean field theory, by obtaining the free energy per site, </?(n, T, H). First, 
note that <p> = cj + 2 fin, where to = £l/N s i tes is the Landau, or grand canonical, free energy 
per site. This follows from the general relation Q = F — /xN; note that the total electron 
number is N = nN s i tes , since n is the electron number per unit cell (including both spin 
species). If g(e) is the density of states per unit cell (rather than per unit volume), then we 
have 4 



oo 



p = \U{m 2 + n 2 ) + fin - \k B Tjdeg{e) j In (l + e (P-e-A)/k B T^ + ln ^ + e (n-e+A)/k B T^ j 

— oo 

(4.98) 

where fx = fi — Un and A = fi B H+^Um. From this free energy we derive two self-consistent 
equations for /i and m. The first comes from demanding that ip be a function of n and not 
of fi, i.e. dip/dji = 0, which leads to 



■oo 

1 

n 



oo 

\ J deg(e) |/(e — A — p,) + f(e + A — /x) j , (4.99) 



2 

-■oo 



where f(y) = [exp(y/k B T) + l] 1 is the Fermi function. The second equation comes from 
minimizing / with respect to average moment m: 



oo 



in 



= I Jde g(e){f(e- A- p)-f(e + A- £)} . (4.100) 



-oo 



Here, we will solve the first equation, eq. 4.99, and use the results to generate a Landau 
expansion of the free energy ip in powers of m 2 . We assume that A is small, in which case 
we may write 



n 



oo 

Jdeg(e) [f(e - fi) + \A 2 f\e - p) + £ A 4 f""(e -£) + ...}. (4.101) 



'Note that we have written fin — /in + ^Un 2 , which explains the sign of the coefficient of n 2 
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We write /u(A) = fj, + <5/x and expand in 5//. Since n is fixed in our (canonical) ensemble, 
we have 

oo 

n= Jdeg(s)f{e-fi ) , (4.102) 

— oo 

which defines fi (n,T). 5 The remaining terms in the 5fi expansion of eqn. 4.101 must sum 
to zero. This yields 

£>(Ao) Sfi + lA 2 D'(fi ) + \{5ji) 2 D'(ft ) + |D"(/2o) A 2 ^ + £ lT(/2 ) A 4 + 0(A 6 ) = , 

(4.103) 

where 

oo 

D(v) = -Jdeg(e)f'(e-(i) (4.104) 

— oo 

is the thermally averaged bare density of states at energy [i. Note that the k th derivative is 

oo 

£ (fc V) = " JdegW(e)f'(e-n) . (4.105) 

— oo 

Solving for Sfi, we obtain 

5ft = _i 0l A 2 - ^(3af- 60^2 + 03) A 4 + C(A 6 ) , (4.106) 

where 

«* - ^¥ • (4.107) 

After integrating by parts and inserting this result for 5fi into our expression for the free 
energy /, we obtain the expansion 



<p(n, T, m) = <p (n, T) + \Um 2 - \D{ji ) A 2 + § ( " 3 ^'(Ao) J A 4 + . 

(4.108) 



£>(£o) 

where prime denotes differentiation with respect to argument, at m = 0, and 

00 

¥> (n, T) = i[/n 2 + nft - J de T(e) f(e- £ ) , (4.109) 

—00 

where g{e) = r'(e), so T(e) is the integrated bare density of states per unit cell in the 
absence of any magnetic field (including both spin species). 



5 The Gibbs-Duhem relation guarantees that such an equation of state exists, relating any three intensive 
thermodynamic quantities. 
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We assume that H and m are small, in which case 



11 K 

V = ip + \am 2 + \bm A - \x Q H 2 - Hm+... , (4.110) 
where X = /i|.D(/2 ) is the Pauli susceptibility, and 

a=\U(l-\UD) , 14^-1^ , (4.111) 

where the argument of each L>( fc ) above is fi (n,T). The magnetization density (per unit 
cell, rather than per unit volume) is given by 

M -M^ H +wr- <4 - 112) 

Minimizing with respect to m yields 

r/y 

am + 6m 3 & H = , (4.113) 

which gives, for small m, 



m= Xv H 

M B 1 - |?7D 



We therefore obtain M = XH with 



where 



X=TTV' ( 4 - 115 ) 

1 t/c 



The denominator of X increases the susceptibility above the bare Pauli value X , and is 
referred to as - I kid you not - the Stoner enhancement. 

It is worth emphasizing that the magnetization per unit cell is given by 

This is an operator identity and is valid for any value of m, and not only small m. 

When H = we can still get a magnetic moment, provided U > U c . This is a consequence 
of the simple Landau theory we have derived. Solving for m when H = gives m = when 
U <U C and 

,1/2 

m(£/) 



(2R4) (4 - 118) 



when U > U c , and assuming 6 > 0. Thus we have the usual mean field order parameter 
exponent of (3 = \. 
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4.6.3 Antiferromagnetic Solution 

In addition to ferromagnetism, there may be other ordered states which solve the mean field 
theory. One such example is antiferromagnetism. On a bipartite lattice, the antiferromag- 
netic mean field theory is obtained from 

(n ia ) = \n+\a e iQ R > m , (4.119) 

where Q = (ir/a, ir/a, . . . , it /a) is the antiferromagnetic ordering wavevector. The grand 
canonical Hamiltonian is then 



- \Um £ e iQ R * a c\ c c ic + \N sitcs U(m 2 - n 2 ) 

ia 

i sr^ ( f f \ ( £(k) — fi + 2?7n ^<r ?7to 

ko V 2 V , - , . 

+ lN sitcs U(m 2 - n 2 ) , (4.120) 



2° U ln \ | C fc,<r 



where e(k) = —i(k), as before. On a bipartite lattice, with nearest neighbor hopping only, 
we have s(k + Q) = —e(k). The above matrix is diagonalized by a unitary transformation, 
yielding the eigenvalues 

X = ±n e(k) -fl (4.121) 
n £ = Ve 2 + A 2 , (4.122) 

with A = \\Jm and p, = [i — \Un as before. The free energy per unit cell is then 

oo 

„ _ + „>) + m _ kKr j*M { _ (1 + +1 » (1 + .O^VV) } . 

— oo 

(4.123) 



The mean field equations are then 

oo 

\ Jdeg(e) {/( - Q £ - p) + f(Q £ - fi) } (4.124) 

-oo 
oo 

I? { /( " " e " ^ " ^ " (4 - 125) 



n- 2 



17 5 



As in the case of the ferromagnet, a paramagnetic solution with m = always exists, in 
which case the second of the above equations is no longer valid. 
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4.6.4 Mean Field Phase Diagram of the Hubbard Model 

Let us compare the mean field theories for the ferromagnetic and antiferromagnetic states 
at T = and H = 0.. Due to particle- hole symmetry, we may assume < n < 1 without 
loss of generality. (The solutions repeat themselves under n — > 2 — n.) For the paramagnet, 
we have 



n = 

— oo 



jdeg(s) (4.126) 

-oo 

<p = \Un 2 + jdeg{e)e , (4.127) 

— oo 

with p, = fi — \Un is the 'renormalized' Fermi energy and g(e) is the density of states per 
unit cell in the absence of any explicit (H) or implicit (m) symmetry breaking, including 
both spin polarizations. 

For the ferromagnet, 

//-A 

n 



4A 

IT 



= \fdeg{e) + \jdeg{e) (4.128) 

— oo — oo 

fl+A 

= Jdeg(e) (4.129) 



ft- A 



p-A p,+A 



ip=lUn 2 ~ jjr + Jdeg(e)e + Jdeg(e)e . (4.130) 



Here, A = \Um is nonzero in the ordered phase. 
Finally, the antiferromagnetic mean field equations are 



n = dsg(e) (p, < 0) (4.131) 



Jdeg(e) (jl>0) (4.132) 



oo 

2 



oo 



l=\de 9{£) (4.133) 



U J Ve 2 + A 2 

oo 

a r 

V = \Un 2 + deg{e) V ' e 2 + A 2 , (4.134) 
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Figure 4.5: Mean field phase diagram of the Hubbard model, including paramagnetic (P), 
ferromagnetic (F), and antiferromagnetic (A) phases. Left panel: results using a semicir- 
cular density of states function of half-bandwidth W. Right panel: results using a two- 
dimensional square lattice density of states with nearest neighbor hopping t, from J. E. 
Hirsch, Phys. Rev. B 31, 4403 (1985). The phase boundary between F and A phases is 
first order. 



where e = \/ fJ. 2 — A 2 and A = hllm as before. Note that \fj,\ > A for these solutions. 
Exactly at half-filling, we have n = 1 and p, = 0. We then set e = 0. 

The paramagnet to ferromagnet transition may be first or second order, depending on the 
details of g(s). If second order, it occurs at = l/g(jl p ), where p> P (n) is the paramagnetic 
solution for p,. The paramagnet to antiferromagnet transition is always second order in 
this mean field theory, since the RHS of eqn. (4.133) is a monotonic function of A. This 

/too 
Jde g{e) e~ l . Note that U£ — ► logarithmically for n — > 1, 

since £l p = at half-filling. 

For large U, the ferromagnetic solution always has the lowest energy, and therefore if U£ < 
Uc, there will be a first-order antiferromagnet to ferromagnet transition at some value 
U* > U^. In fig. 4.5, I plot the phase diagram obtained by solving the mean field equations 
assuming a semicircular density of states g(e) = - W~ 2 ^/W 2 — e 1 . Also shown is the phase 
diagram for the d = 2 square lattice Hubbard model obtained by J. Hirsch (1985). 

How well does Stoner theory describe the physics of the Hubbard model? Quantum Monte 
Carlo calculations by J. Hirsch (1985) showed that the actual phase diagram of the d = 
2 square lattice Hubbard Model exhibits no ferromagnetism for any n up to U = 10. 
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Furthermore, the antiferromagnetic phase is entirely confined to the vertical line n = 1. For 
and < U < 10, the system is a paramagnet 6 . In order to achieve a ferromagnetic 
solution, it appears necessary to introduce geometric frustration, either by including a next- 
nearest-neighbor hopping amplitude t' or by defining the model on non-bipartite lattices. 
Numerical work by M. Ulmke (1997) showed the existence of a ferromagnetic phase at T = 
on the FCC lattice Hubbard model for U = 6 and n € [0.15, 0.87] (approximately). 



4.7 Interaction of Local Moments: the Heisenberg Model 

While it is true that electronshave magnetic dipole moments, the corresponding dipole- 
dipole interactions in solids are usually negligible. This is easily seen by estimating the 
energy scale of the dipole-dipole interaction: 

E d _ d = m 1 -m 2 -3(m l .n)(m 2 .n) (4 ^ 

\ r l - r 2\ 

where h = (r 2 — r 1 )/\r 2 — r 1 | is the direction vector pointing from r x to r 2 . Substituting 
m = — fi B a, we estimate i?d-d as 

and with R ~ 2.5A we obtain E'd-d — 1 MeV, which is tiny on the scale of electronic energies 
in solids. The dominant magnetic coupling comes from the Coulomb interaction. 



4.7.1 Ferromagnetic Exchange of Orthogonal Orbitals 

In the Wannier basis, we may write the Coulomb interaction as 

v = \ E E( fi i^l F ^|^4 J R 3 >4 1CT 4 2 .'^ 3 .'^ 4(7 , (4.137) 



-R 3 ,-R 4 



where we have assumed a single energy band. The Coulomb matrix element is 

(fii^l | 7 ^|i? 4 ^3> = jd\Jd\'^\r-R 1 )^\r'-R 2 )^^^{r'-R z )ip{r-R A ) . 

(4.138) 

Due to overlap factors, the matrix element will be small unless i?2 = R3 and R± = R4, in 
which case we obtain the direct Coulomb interaction, 

e 2 

V(R -R') = {RR'\ I RR' ) 



Jd 3 rjdV \ip(r - R)\ 2 * 2 \<p(r'-R')\ 2 . (4.139) 



6 A theorem due to Nagaoka establishes that the ground state is ferromagnetic for the case of a single 
hole in the U — 00 system on bipartite lattices. 
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The direct interaction decays as \R — R'\ 1 at large separations. Were we to include only 
these matrix elements, the second quantized form of the Coulomb interaction would be 

^direct = \ E V ( R ~ R ^ ( n Ro n R'o> - 6 RR> 5 ™> n R°) 

RR' 

= Y. V ^ n R^ n Ri + \ E V(R-R')n R n R , , (4.140) 

R R^R' 

where n R = n R ^ + n R ^. The first term is the on-site Hubbard repulsion; one abbreviates 
U = V(0). 

A second class of matrix elements can be identified: those with Ri = R% = R and R2 = 
R4 = R', with R / R'. These are the so-called exchange integrals: 

e 2 

J(R-R') = (RR'\-———\R'R) 

= Jd 3 rjd 3 r' tp*(r - R) ip*(r' - R') ^ <p(r' - R) <p{r - R) 
= Jd 3 rjd 3 r'tp*(r)<p(r + R-R') - e * — <p*(r' + R - R) <p(r') ■ (4.141) 
Note that J(R — R) is real. The exchange part of V is then 

Exchange = 41^" ^ C L C R«> C *V C KV ( 4 - 142 ) 
R^R> 

oV 

= -i E J ( R ~ R ') { n R n R' + a R ■ a R>) ■ ( 4 - 143 ) 
R^R' 

The n R n R , piece can be lumped with the direct density-density interaction. What is new 
is the Heisenberg interaction, 

^Heis = - E J ( R ~ R ') S R ■ S R' ■ ( 4144 ) 
R^R' 

J(R — R) is usually positive, and this gives us an explanation of Hund's first rule, which 
says to maximize S. This raises an interesting point, because we know that the ground 
state spatial wavefunction for the general two-body Hamiltonian 

n = "^(V? + V 2 ,) + V(\ ri - r 2 \) (4.145) 

is nodeless. Thus, for fermions, the ground state spin wavefunction is an antisymmetric 
singlet state, corresponding to S = 0. Yet the V 3+ ion, with electronic configuration 
[Ar] 3d 2 , has a triplet S = 1 ground state, according to Hund's first rule. Why don't the 
two 3d electrons have a singlet ground state, as the 'no nodes theorem' would seem to 
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imply? The answer must have to do with the presence of the core electrons. Two electrons 
in the Is shell do have a singlet ground state - indeed that is the only possibility. But 
the two 3d electrons in V 3+ arc not independent, but must be orthogonalized to the core 
states. This in effect projects out certain parts of the wavefunction, rendering the no nodes 
theorem inapplicable. 

4.7.2 Heitler-London Theory of the H 2 Molecule 



The Hamiltonian for the H 2 molecule is 



H = 



2m 



+ 



ki - R J 

„2 



~ R b\ 



2m 



\ r 2~ R b\ 



\ r 2 ~ R a 



+ 



(4.146) 



The total wavefunction is antisymmetric: ^f(r 1 a 1 , r 2 a 2 ) = —^(r 2 a 2 , f^Oi). The N = 2 
electron case is special because the wavefunction factorizes into a product: 



^(r l a l ,r 2 a 2 ) = ^{r 1 ,r 2 )x{cF 1 ,a 2 ) . 



(4.147) 



The spin wavefunction may either be symmetric (triplet, S = 1), or antisymmetric (singlet, 
S = 0): 

'|TT> s = i 



7s(|U> + UT>) S = i 

I a) s = i 



(4.148) 



^(|U>-|1T>) s = o. 

A symmetric spin wavefunction requires an antisymmetric spatial one, and vice versa. 

Despite the fact that H does not explicitly depend on spin, the effective low-energy Hamil- 
tonian for this system is 

H cS = K + J5i • S 2 ■ (4.149) 



The singlet-triplet splitting is AE = E 



5=0 



E 



5=1 



-J, so if J > the ground state is 



the singlet, and if J < the ground state is the three-fold degenerate triplet. 
The one-electron Is eigenfunction ip(r) satisfies the following eigenvalue equation: 



2m 



V 2 - 



Y>(r) = e (r)Y>(r) . 



(4.150) 



In the Heitler-London approach, we write the two-electron wavefunction as a linear combi- 
nation 

$(r 1 ,r 2 ) = a$ I (r 1 ,r 2 ) + /3$„(r 1 ,r 2 ) , (4.151) 
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with 

*i(* , i»* , 2 ) = ^( r l " R a) V^ 2 - R h ) = ^a(^l) V- b (r 2 ) (4.152) 
*n(ri,r 2 ) = V(n - R h ) ^{r 2 - R a ) = ^(rj ^ a (r 2 ) . (4.153) 

Assuming the atomic orbital ^(r) to be normalized, we define the following integrals: 



f e 2 


e 2 


e 2 


e 2N 






r ib 


r 2a' 


V^ab 


e 2 


e 2 


e 2 


r 12 


r la 


r 2b 



A = Jd^r^Mr) (4-154) 
X = fd\ lUr^l 2 (^ + — - — -— ) (4.155) 

Y= /^ 1 /dV 2 ^(r 1 ,r 2 )* n (r 1 ,r 2 ) + — - — - — ) , (4.156) 

^ ^ ^ab r 12 r lb r 2a^ 

with r la = r 1 — i? a , etc. The expectation value of H in the state $ is 

< $ | H | $ > = (H 2 + |/3| 2 ) (2e + X) + (a*/? + /Fa) (2e|A| 2 + Y) , (4.157) 
and the self-overlap is 

( $ | $ ) = \a\ 2 + |/?| 2 + |A| 2 (a*(3 + /Fa) . (4.158) 

We now minimize (TC) subject to the condition that $ be normalized, using a Lagrange 
multiplier E to impose the normalization. Extremizing with respect to a* and (5* yields 

(^^xr/)®-U'r)(;). <-> 

and extremizing with respect to E yields the normalization condition 

H 2 + \I3\ 2 + |A| 2 (a* 13 + /Fa) = 1 . (4.160) 

The solutions are symmetric and antisymmetric states, with (3/a = ±1, corresponding to 
the energies 

E ± = 2£ o + ^^p- (4-161) 

Note that E, is the energy of the spatially symmetric state, which means a spin singlet 
while E_ corresponds to the spatially antisymmetric spin triplet. 

The singlet-triplet splitting is 
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If J > 0, the triplet lies higher than the singlet, which says the ground state is antiferro- 
magnetic. If J < 0, the triplet lies lower, and the ground state is ferromagnetic. The energy 
difference is largely determined by the Y integral: 



e 2 e 2 



+ 

^ab r 12 



Y = jd\jd\ 2 T*{ ri )T{r 2 ) 

-2A*fd\r,(r)-^-—A(r) , (4.163) 
J \r- R h \ 

with T(r) = Va( r ) ip h (r). The first term is positive definite for the Coulomb interaction. 
The second term competes with the first if the overlap is considerable. The moral of the 
story now emerges: 

weak overlap ==> ferromagnetism ( J < 0) 
strong overlap =>■ antiferromagnetism ( J > 0) . 

One finds that the H2 molecule is indeed bound in the singlet state - the total energy has 
a minimum as a function of the separation \R a — R h \. In the triplet state, the molecule is 
unbound. 



4.7.3 Failure of Heitler-London Theory 



At large separations R = \R a — R b \ the Heitler-London method describes two H atoms 
with tiny overlap of the electronic wavefunctions. But this tiny overlap is what determines 
whether the ground state is a total spin singlet or triplet (we ignore coupling to the nuclear 
spin). Sugiura obtained the following expression for the singlet-triplet splitting in the 
R — ► 00 limit: 

(£)'(£) 

where 7 = 0.577... is the Euler constant and where ip(r) = (7ra|)~ 1//2 exp(— r/a H ) is the 
hydrogenic wavefunction. This is negative for sufficiently large separations (R > 50 a H ). 
But this is a problem, since the eigenvalue problem is a Sturm-Liouville problem, hence the 
lowest energy eigenfunction must be spatially symmetric - the singlet state must always lie 
at lower energy than the triplet. The problem here is that Heitler-London theory does a 
good job on the wavefunction where it is large, i.e. in the vicinity of the protons, but a 
lousy job in the overlap region. 



J(R) 



56 
45 



&7- 



Is" ln 



R 



4.7.4 Herring's approach 

Conyers Herring was the first to elucidate the failure of Heitler-London theory at large 
separations. He also showed how to properly derive a Heisenberg model for a lattice of 
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hydrogenic orbitals. Herring started with the symmetric spatial wavefunction 

N 



*(r 1 ,...,r JV ) = n^-fi i ). (4.165) 

i=i 

This wavefunction would be appropriate were the electrons distinguishable. If we permute 
the electron coordinates using a spatial permutation P space £ S N , we obtain another wave- 
function of the same energy, E . However, there will be an overlap between these states: 

Jp = {*\H-E \P r V). (4.166) 

The effective Hamiltonian is then 

H cS = E + Y< J r P r- ( 4 - 167 ) 

Pes N 

A complete permutation P is a product of spatial and spin permutations: P = P r . P a , and 
the product when acting on an electronic wavefunction is (—l) p , which is +1 for an even 
permutation and ( — 1) for an odd one 7 . Thus, 

W cS = E + Yl i-lfjpP*- (4-168) 

The spin permutation operators P a may be written in terms of the Pauli spin matrices, 
once we note that the two-cycle (ij) may be written 

p (ij) = h + h°i-<rj • ( 4 - 169 ) 
Thus, accounting for only two-cycles, we have 

Ks = E -\Yl Jij (1 + *i ■ (Tj) • (4.170) 

For three-cycles, we have 

P (ijk) = P (ik) P (jk) 

= \ l + a i -a j + a j -(T k + (T i -a k + icr i x (Tj ■ a k 



(4.171) 



4.8 Mean Field Theory 



We begin with the Heisenberg Hamiltonian 



h3 



7 Here, 'even' and 'odd' refer to the number of 2-cycles into which a given permutation is decomposed. 
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and write 

Si = mi + SSi, (4.173) 
where rrii = (Si) is the thermodynamic average of Si. We therefore have 

Si ■ Sj = rn, ■ rrij + rn, ■ 5Sj + rrij ■ 5Si + 5S{ ■ SSj 

= -rrii ■ mj + m, • Sj + rrij ■ Si + 5 Si ■ 5Sj . (4.174) 

The last term is quadratic in the fluctuations, and as an approximation we ignore it. This 
results in the following mean field Hamiltonian , 

7i MF = +5 ^2 Jij rn, • - ^ (jHi + ^ m^j ■ Si 

i,j i j 

= E - 7 ^2Hf-S l , (4.175) 

i 

where 

Eq =\^2 Jij mi -rrij (4.176) 
Hf = H i + 1 - 1 Y J Jijm j . (4.177) 

3 

Note how the effective field Hf s is a sum of the external field Hi and the internal field 
Hf 11 = 7 _1 J2j Jij 171 j- Self-consistency now requires that 

Tr S-exp(~/Hf ■ SJk B T) 

m i = * , 1 — ^~r~ > 4 - 178 

Tr exp ( 7 Hf • SJk B T) 

where Tr means to sum or integrate over all local degrees of freedom (for site i). The free 
energy is then 

F({rrii}) = \ Jij ™i ■ rrij - k B T £ In Tr exp ( 7 Hf ■ S l /k B T) . (4.179) 

For classical systems, there are several common models: 

• Ising Model with S = ±1: 

rm = t&nh(~/Hf/k B T) 

= tanh(/? 7 Fi + /3^ Jijm^j . (4.180) 

3 

The free energy is 

F = \ ^ J,^ rrii mj - k B T ^ In 2 cosh (p^Hi + (3 ^ J^- m,) . (4.181) 

i,3 i 3 
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• Ising Model with S = -1, 0, +1: 

2 sinhf (3-fHi + (3J2j J ij m j 

m l = ±- '-^ (4.182) 

1 + 2cosh^7fli + (3 Jij mj) 

and 

F = \ Jij m mj - k B T ^ In i 1 + 2 cosh (p^/Hi + (3 J tj m^) \ . (4.183) 

i,j i j 

• XY model with Si = (cos ft, sin ft), H = H x 

/ dft cos ft exp (7iJ? ff cos ft/A; B T) 

rm = (cos ft) = (4.184) 

Jdft exp( 7 tff cos ejk B T) 



o 

iAthHi + pY.jJiM 



(4.185) 



where / n (^) is a modified Bessel function. The free energy is 

F = | mi mj - /c B T In 2vr/ (/37-Hj + (3 ^ ft, m,) . (4. 186) 

• 0(3) model with Si = (sin 6i cos 0j, sin 9i sin <^>j, cos 9i). Suppose that points in the 
direction of Hf s . Then 

2tt / dft sin ft cos ft exp (jHf cosft/A; B T) 
mi = (cos ft) = (4.187) 

2vr Jd0i sin ft exp (7^ cos 0Jk B T) 


= ctnh( 7 tff /k B T) - (4.188) 



7 J 

= ctnh (/J 7 ft 4- /J £ J„ m,) - 7fli+ g Ja „ aj ■ (4.189) 

The free energy is 



f 4vr sinh f (3jHi + (3 £ . J - 

F = i E J « "* " *» r E { ftiws^v > ' (4 ' 190) 



EXERCISE: Show that the self-consistency is equivalent to dF/drrii = 0. 
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4.8.1 Ferromagnets 

Ising Model - Let us assume that the system orders ferromagnetically, with nii = m on all 
sites. Then, defining 

J{q) = Y J J{R)z- iq R , (4-191) 
R 

we have that the free energy per site, / = F/N, is 

/(m) = lj(0)m 2 - £; B TlnTrexp{( 7 J/ + J(0) m) • S/A: b t} . (4.192) 

For the 7L<i (Ising) model, we have 

m = tanh (/3'yH + 0J(O)m) , (4.193) 

a transcendental equation for m. For i7 = 0, we find m = tanh(J(0)m//c B T), which yields 
the Curie temperature T c = J(0)/k B . 

0(3) Model - We have m = mJ^ lies along H. In the H — ► limit, there is no preferred 
direction. The amplitude, however, satisfies 

-jL = m = ctnh (J(0) m/fc B T) - ^ . (4.194) 

um J(0) m 

With x = J(0) m/k B T, then, 

x = ctnhx - - = - - %- + . . . , (4.195) 
J(0) s 3 45 v 7 

hence T c = J(0)/3k B . 

4.8.2 Antiferromagnets 

If the lattice is bipartite, then we have two order parameters: m A and m B . Suppose 
Jjj = — J < if i and j are nearest neighbors, and zero otherwise. The effective fields on 
the A and B sublattices are given by 

Hfv^H-^zJm^, (4.196) 

Note that the internal field on the A sublattice is —j~ 1 zJm B , while the internal field 
on the B sublattice is —^ 1 zJm A . For the spin-5 quantum Heisenberg model, where 
S z e {-S,...,+S}, we have 

sinh(,S+l)£ 

Tr exp(£ • S) = , (4.197) 

hence, with £ = jH^ B /k B T, we have 

(S) = iSB s (S0 (4.198) 
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where B s (x) is the Brillouin function, 

B s (x) = (l + ^) ctnh ((1 + ^) x) - ± ctnh (±) . (4.199) 

In order to best take advantage of the antiferromagnetic interaction and the external mag- 
netic field, the ordered state is characterized by a spin flop in which m A and m B are, for 
weak fields, oriented in opposite directions in a plane perpendicular to H, but each with a 
small component along H. 

When H = 0, the mean field equations take the form 

m A = SB s (zJSm B /k B T) (4.200) 
m B = SB s (zJSm A /k B T) , (4.201) 

where we have assumed m A and m B are antiparallel, with m A = m A h and m B = — m B h, 
where n is a unit vector. From the expansion of the Brillouin function, we obtain the Neel 
temperature T N = z.J/k B . 



4.8.3 Susceptibility 

For T > T c the system is paramagnetic, and there is a linear response to an external field, 

J J * J 

= t^ri^ Sj) ~ ($D (%)} (4-203) 

where {i,j} are site indices and {fi, u} are internal spin indices. The mean field Hamiltonian 
is, up to a constant, 

n MF = -jY1 H f • Si ' ( 4 - 204 ) 

i 

which is a sum of single site terms. Hence, the response within 7i MF must be purely local 
as well as isotropic. That is, for weak effective fields, using Mj = jrrii, 

M. = X Hf = X H i + r 2 X J tj M j , (4.205) 

which is equivalent to 

% ~ r 2 X J tj ) M- =X H i , (4.206) 
and the mean field susceptibility is 

*%= [^-r 2 J]~-^. (4.207) 

It is convenient to work in Fourier space, in which case the matrix structure is avoided and 
one has 

m = - — tStt • ( 4 - 208 ) 

1 - 7" 2 X J(q) 
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The local susceptibility X is readily determined: 

7 V 7 7 Tr exp( 7 if • S/k B T) 

= 1 SB s (S 1 H/k B T)H» , (4.209) 

where B s (x) is the Brillouin function from eqn. 4.199. As H — > we have M = X H, 
with 

^ = ^"^1 5X » r ' (4 - 210) 

where X = Tr (S 2 )/ Tr 1, where iV is the number of components of 5 M . Thus, for the Ising 
model (N = 1) we have XQ Smg = j 2 /k B T, while for the spin-S 1 quantum Heisenberg model 
we have Xo eis = S(S + l)7 2 /3fc B T. Note that X oc T -1 ; the splitting of the degenerate 
energy levels by the magnetic field is of little consequence at high temperatures. 

In many cases one deals with 'single ion anisotropy' terms. For example, one can add to 
the Heisenberg Hamiltonian a term such as 

v2 



H^D^iS?) 2 , (4.211) 



which for D < results in an easy axis anisotropy (i.e. the spins prefer to align along the 
z-axis), and for D > results in an easy plane anisotropy (i.e. the spins prefer to lie in 
the (x, y) plane). Since this term is already the sum of single site Hamiltonians, there is no 
need to subject it to a mean field treatment. One then obtains the mean field Hamiltonian 

h mf = ^E(^) 2 -tE H f ■ s * ■ ( 4 - 212 ) 

i i 

In this case, X is now anisotropic in spin space. The general formula for X is 

A V = j^f (SW) (4.213) 

where the thermodynamic average is taken with respect to the single site Hamiltonian. 8 
One then has 

= [I - 7 _2 i(9) X C . ( 4 - 214 ) 
where the matrix inverse is now in internal spin space. 



4.8.4 Variational Probability Distribution 

Here's another way to derive mean field theory. Let Q represent a configuration and let Pq 
be a probability distribution, normalized such that J2n^n = 1- We- define the entropy of 
the distribution as 



S[P] = -k B J2Pa^P n . (4.215) 



j Note that in (4.210) the single site Hamiltonian is simply Ho = 0. 
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We now ask what distribution Pq minimizes the free energy F = (TC) — TS. Working in an 
eigenbasis of 7i, we have 

F = £ p n E n + k B T £ P n In P n . (4.216) 

n n 

We extremize F subject to the normalization constraint, which is implemented with a 
Lagrange multiplier A. This means we form the extended function 

F*{{Pnh*) =5> f ^n + fc B Tj> a lnP f ,-A(j> n -l) , (4.217) 

n n n 

and demand dF*/dP n = for all as well as dF*/d\ = 0. This results in the Boltzmann 
distribution, 

pcq = I e -EJk B T ; z = e ~Ei/k B T _ (4 218) 

I 

Thus, any distribution other than results in a larger free energy. 

Mean field theory may be formulated in terms of a variational probability distribution. 
Thus, rather than working with the Boltzmann distribution P^ q , which is usually in- 
tractable, we invoke a trial distribution Pq(x 1 , x 2 , ■ ■ •), parameterized by {x ± , x 2 , ■ ■ •}, and 
minimize the resultant F = (H) — TS with respect to those parameters. 

As an example, consider the Ising model with spins &i = ±1. Each configuration is given 
by the set of spin polarizations: VL = {c^, . . . ,a N }. The full equilibrium probability distri- 
bution, 

= Z- 1 exp (j3J a i a j) ' (4.219) 

(ij) 

with P = l/k B T, is too cumbersome to work with. We replace this with a variational 
single-site distribution, 

N 

Pn = I[Pi(<7i) (4-220) 

Pi(o-i) = \{l + rrn) S aii+1 + \{l - rm) <5 CTi) _! . (4.221) 

The variational parameters are {m 1 , . . . ,m N }. Note that P Q is properly normalized, by 
construction. 

The entropy of our trial distribution is decomposed into a sum over single site terms: 

S[P] = J2 s ( m i) (4.222) 




(4.223) 
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The thermodynamic average (<jj) is simply 

(ai) = TrP i (a i )a i = m i , (4.224) 

hence from 

we derive the free energy 

, m I 1 + mi , / 1 + mj \ 1 — mi , / 1 — m; \ 1 

+ *. r £{— + — '"(—)} 

Varying with respect to each m^ we obtain the coupled nonlinear mean field equations, 

mi = tanh ^ J^- rn, + 7^) /k B T . (4.227) 



For uniform magnetization (mj = mVi), the free energy per site is 

F 1 ;/ n \ 2 tt ? ^J 1 + m i /l + m\ 1-m /l-m\l 
- = -I J(0) m 2 - 7 tf m + fc B T j — In (— ) + — In ( — ) j 

= 5(fe B T- J(0))m 2 - 7 Fm+^/c B Tm 4 + ^/c B Tm 6 + ... (4.228) 



To compute the correlations, we may use the expression 

2 

Xij(T) = ^{(nvj) ~ fa> fo>} (4-229) 



_ dMi _ dmi _ d 2 F 

~ dH~ ~ 1 dH~~~dH i dHj ' ( ' ' 

Thus, there are two ways to compute the susceptibility. One is to evaluate the spin-spin 
correlation function, as in (4.229). The other is to differentiate the magnetization to obtain 
the response function, as in (4.230). The equality between the two - called the "fluctuation- 
dissipation theorem" - is in fact only valid for the equilibrium Boltzmann distribution P^ q . 
Which side of the equation should we use in our variational mean field theory? It is more 
accurate to use the response function. To roughly see this, let us write P = P eq + 8P, with 
SP small in some sense. The free energy is given by 

F[P] =F[P eci }+5P' 6 ' 



5P 



+ 0{{SP) 2 ) . (4.231) 

P=peq 



Our variational treatment guarantees that the second term vanishes, since we extremize F 
with respect to P. Thus, in some sense, the error in F is only of order (SP) 2 . If we compute 
the correlation function using (A) = Tr (PA), where A is any operator, then the error will 
be linear in SP. So it is better to use the response function than the correlation function. 



EXERCISE: Articulate the correspondence between this variational version of mean field 
theory and the 'neglect of fluctuations' approach we derived earlier. 
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4.9 Magnetic Ordering 

The q-dependent susceptibility in (4.208) diverges when r y~ 2 X J(q) = 1. As we know, 
such a divergence heralds the onset of a phase transition where there is a spontaneous 
magnetization in the ordered (i.e. low temperature) phase. Typically this happens at a 
particular wavevector Q, or a set of symmetry related wavevectors {Qi,Q2, ■ ■ ■}■ The 
ordering wavevector is that value of q which results in a maximum of J(q): max g { J(q)} = 
J(Q). The susceptibility, for isotropic systems, can be written 

X(q) = ; — ; ; • (4-232) 

[1 - 7-% J(Q)]+ 7~% [</(<?)-</(<?)] 

The critical temperature T c is determined by the relation 

7 - 2 X (T c ) J(Q) = 1. (4.233) 
Expanding about T = T c , and about q = Q, where 

J(g) = J(Q){l - (g - QfRl + ...}, (4.234) 

we have 

m * ^trw • (4 ' 235) 



where 



£-\T) = -^-^ ■ R- 2 ■ (T - T c ) . (4.236) 

Thus, £(T) oc (T — T c ) -1 / 2 . The real space susceptibility X(Ri-Rj) oscillates with wavevec- 
tor Q and decays on the scale of the correlation length £(T). 



Ferromagnet: J^j = +J > if i and j are nearest neighbors; otherwise Jij = 0. On a 
hypercubic lattice (d dimensions, 2d nearest neighbors), we then have 

J(q) = J^ e - iqS = 2 j{cos(g 1 a) + cos(g 2 a) + . . . + cos(q d a)} . (4.237) 

The ordering wavevector is Q = 0, and J(Q) = 2dJ. For the spin-S" Heisenberg 
model, then, T c = ^dS(S + 1) J/k B , and the susceptibility is 

h 2 s(s + l)/k B 

x(q) = 2j — ^ — 7 • 4 - 238 

(r-r c ) + r c d-iE^i[i-«»M] 

The uniform susceptibility X = x(g = 0) is then 

X{T) = 7 ^ (S + 1) . (4.239) 
3k B (T-T c ) 

Ferromagnetic insulators: ferrites, EuO, TDAE-Ceo- 
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• Antiferromagnet: Jij = —J < if i and j are nearest neighbors; otherwise Jjj = 0. 
On a hypercubic lattice (d dimensions, 2d nearest neighbors), we then have 

J(q) = -J^2e~ iqS = -2j{cos(g 1 a) + cos(g 2 a) + . . . + cos(q d a)} . (4.240) 
s 



The ordering wavevector is Q = (n/a, . . . , 7r/a), at the zone corner, where J(Q) 

r - ? 

■N — 3 



2d7. For the spin-5 Heisenberg model, then, T N = ^dS(S + 1) J/fe B , and the suscep- 



tibility is 

7 2 £(S + l)/3fc : 
(T - T N ) +T N d-i^ = i [1 + cos(^a)] 
The uniform susceptibility X = x(Q = 0) is then 



X(Q) = ~ — 1 ! Tf — T • (4-241) 



= 7 2 S(S+1) 
3k B (T + T N ) 

which does not diverge. Indeed, plotting X _1 (T) versus T, one obtains an intercept 
along the T-axis at T = — T N . This is one crude way of estimating the Neel tem- 
perature. What does diverge is the staggered susceptibility X s tag = x{Q,T), i.e. the 
susceptibility at the ordering wavevector: 

• Frustrated Antiferromagnet: On the triangular lattice, the antiferromagnetic state 
is frustrated. What does mean field theory predict? We begin by writing primitive 
direct lattice vectors {a a ,a 2 } and primitive reciprocal lattice vectors {b 1 ,b 2 }: 

a, = a (1,0) fo 1 = ^(f,-I) (4.244) 

a 2 = a{l4) 6 2 =i?=(0,l), (4.245) 

where a is the lattice constant. The six nearest neighbor vectors are then 

5 € ^a^a 2 ,a 2 - a l ,-a l ,-a 2 ,a l -a 2 | , (4.246) 

and writing q = x ± bi + X2b 2 , we find 

J(q) = -2J{cos(2vtxi) + cos(2vrx 2 ) + cos(2vrxi - 2vrx 2 )} • (4.247) 

We suspect that this should be maximized somewhere along the perimeter of the 
Brillouin zone. The face center lies at (x±,X2) = (3, 5), where J(q) = +2J. However, 
an even greater value is obtained either of the two inequivalent zone corners, (x±, x 2 ) = 
(§, |) and (xi,x 2 ) = (|, |), where J(q) = +3J. Each of these corresponds to a 
tripartite division of the triangular lattice in to three y/3 x y/3 triangular sublattices. 

Antiferromagnetic insulators: MnO, CoO, FeO, NiO, La 2 Cu04. 
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• Helimagnet: Consider a cubic lattice system with mixed ferromagnetic and antiferro- 
magnetic interactions: 



Jij 



+J\ > 6 nearest neighbors 

— J 2 < 12 next-nearest neighbors (4.248) 
otherwise . 



Then 

J(q) = 2Ji [cos(q x a) + cos(q y a) + cos(q z a)] (4.249) 
— 4J 2 [cos(q x a) cos(q y a) + cos(q x a) cos(q z a) + cos(q y a) cos(q z a)] . 

The ordering wavevector is then 



'a- 1 cos" 1 (x + y + z) if J x < 4 J 2 



[0 ifJi>4J 2 . 

Thus, for J\ < 4 J2 the order is incommensurate with the lattice. The maximum value 
of J(q) is 

( 3J 2 

J(Q) = \ 4^ lf Jl < 4J2 (4.251) 

(6(Ji-2J 2 ) ifJi>4J 2 , 

hence incommensurate order sets in at Tj = S(S + 1) Jf /4k B J 2 . The uniform suscep- 
tibility is 

*(0) = ^(5+1)/^ . (4 . 252) 



r _ 8Ti |(l_2^) ' 



Thus, 

c 



X(T)J^ ° < Jl < 2 J2(Uke AFM) (4-253) 
[ySj^ 2J 2 < J x < 4J 2 (like FM) . 



4.9.1 Mean Field Theory of Anisotropic Magnetic Systems 

Consider the anisotropic Heisenberg model, 

intra inter 

H= -J^jlSi-Sj -^jSi-Sj-^Hi-Si. (4.254) 

i<j i<j i 

Here, jf- only connects sites within the same plane (quasi-2d) or chain (quasi-ld), while 
J^j only connects sites in different planes /chains. We assume that we have an adequate 
theory for isolated plains/ chains, and we effect a mean field decomposition on the inter- 
plane/interchain term: 

(fluct) 2 

S t • Sj = -(Si) • (Sj) + (Si) • Sj + (Sj) ■ S t + 6S^6S^ , (4.255) 
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resulting in the effective field 

H cS (q,u;) = H(q,u) + l~ 2 J L {q ± ) M{q,u) , (4.256) 
where M(q,u) = j{S(q,uj)}. Thus, 

x"(g,|,w) 

X(Q, w) = - , (4-257) 

l- 7 -2j^(q ± )X ll (g || ,a,) 

where x"(g,,,a;) is assumed known. 



4.9.2 Quasi-ID Chains 

Consider a ferromagnet on a cubic lattice where the exchange interaction along the z- 
direction (||) is much larger than that in the (x,y) plane (_L). Treating the in-plane inter- 
actions via mean field theory, we have 

1-7 2 J ± (q ± )x 1D ( ( lz) 

with 

J ± (q±) = 2J J _{cos(<7 x a) + cos(g y a)| . (4.259) 
For the Ising model we can compute Xid(^) exactly using the high temperature expansion: 

Tr {o n a n , n 7 (1 + tanh( JJk B T) Oj a j+1 ) } 

{<j n a n ,) = ^ } - (4.260) 

Trn^l + tanMJu/^r)^^!) 

= tanhl n - n 'l(Jn/A; B r) . (4.261) 



Thus, 



2 00 

X 1D (q z ) = ^= £ tanhl"l(J || A B T)e^ c 



k B T 

D n=~oo 

7 2 



K T cosh(2J||/fc B T) -siiJi(2J||/fc B r) cos(g 2 c) 
2tt 7 2 1 f" 1 



(4.262) 
(4.263) 



where c is the lattice spacing along the chains, and where the last approximation is valid 
for q — ► and £ — > oo. The correlation length in this limit is given by 

C(T) ~ | exp(2J || /fc B T) . (4.264) 
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Note that £(T) diverges only at T = 0. This is consistent with the well-known fact that 
the lower critical dimension for systems with discrete global symmetries and short-ranged 
interactions is d = 1. That is to say that there is no spontaneous breaking of any discrete 
symmetry in one-dimension (with the proviso of sufficiently short-ranged interactions). For 
continuous symmetries the lower critical dimension is d = 2, which is the content of the 
Hohenberg-Mermin- Wagner (HMW) theorem. 

Accounting for the residual interchain interactions via mean field theory, we obtain the 
anisotropic (in space) susceptibility 

X(q ± ,q z ) = ; ; • (4-265) 

1 - 7 2 • 2J ± {cos(q x a) + cos(g w a)} • x 1D (<? 2 ) 

Three-dimensional ordering at Q = sets in at T = T c , which occurs when x(Q) has a 
pole. The equation for this pole is 

4J 

4 7 " 2 J ± X 1D = 1 => =exp(-2J || /fc B T c ) . (4.266) 



k B T c 



This transcendental equation is equivalent to 



xe x = X - (4.267) 
where x = 2J^/k B T c and e = 2J ± /J^. The solution, for small e, is 

2J,, 

fc B T c = ^ + . . . . (4.268) 

ln(j ||/2 J ± ) 

Thus, T c > for all finite Jj_, with T c going to zero rather slowly as J ± — ► 0. 

Similar physics is present in the antiferromagnetic insulator phase of the cuprate supercon- 
ductors. The antiferromagnetic (staggered) susceptibility of the two-dimensional Heisenberg 
model diverges as T — > as X^ g ~ exp(pJk B T), where p is a dimensionless measure 
of quantum fluctuations. As in the d = 1 Ising case, there is no phase transition at any 
finite temperature, in this case owing to the HMW theorem. However, when the quasi-2D 
layers are weakly coupled with antiferromagnetic coupling J' (the base structure is a cubic 
perovskite), three-dimensional Neel ordering sets in at the antiferromagnetic wavevector 
Q = (n/a, 7r/a, w/c) at a critical temperature T N rs J/k B ln(J/J'). 



4.10 Spin Wave Theory 

Recall the SU(2) algebra of quantum spin: [S a , S@] = it^S 1 (set h = 1 for convenience). 
Defining 5 ± = S x ± iS' y , we have, equivalently, 

[S 2 , 5 ± ] = ± , 5 _ ] = 2 5 2 . (4.269) 
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The Holstein-Primakoff transformation (1940) maps the spin algebra onto that of a single 
bosonic oscillator: 

S + = (J (2S - a)a) l/2 (4.270) 
S- = (2S-a)a) 1 / 2 a (4.271) 
S z = a f a - S . (4.272) 

The state | S z = —S ) is the vacuum | ) in the boson picture. The highest weight state, 
| S z = +S ) corresponds to the state | 25) in the boson picture, i.e. an occupancy of n = 2S 
bosons. 

EXERCISE: Verify that the bosonic representation of the spin operators in (4.272) satisfies 
the SU(2) commutation relations of quantum spin. 

What does it mean to take the square root of an operator like 2S* — a)al Simple! Just 
evaluate it in a basis diagonal in a' a, i.e. the number basis: 

a)a\n)=n\n) => (2S - a)a) 1/2 \ n ) = (2S - n) 1/2 | n ) . (4.273) 

Note that physical boson states are restricted to n € {0, 1, ... , 2S}. What about states with 
n > 2S1 The nice thing here is that we needn't worry about them at all, because S + , S~ , 
and S z do not connect states with < n < 2S to states with n > 2S. For example, when 
applying the spin raising operator S + to the highest weight state | S z = +S), in boson 
language we have 

S + \S Z = +S) = <J {2S - a f a) 1/2 | n = 2S > = , (4.274) 

as required. 

While the HP transformation is exact, it really doesn't buy us anything unless we start 
making some approximations and derive a systematic expansion in 'spin wave' interactions. 



4.10.1 Ferromagnetic Spin Waves 

Consider the classical ground state |F) = |J,j---j)in which all spins are pointing 'down', 
with S z = —S. In the boson language, the occupancy at each site is zero. This is in fact 
an eigenstate of the Heisenberg Hamiltonian 

U = - J ij s i • s j (4.275) 

i<j 

with eigenvalue Eq = —S 2 J2i<j Jij- If au * ne interactions are ferromagnetic, i.e. Jij > 
V (i,j), then this state clearly is the ground state. We now express the Heisenberg 
interaction Si ■ Sj in terms of the boson creation and annihilation operators. To this end, 
we perform a Taylor expansion of the radical, 

< 2S -^^H@H(^) 2 + ...}, (-a, 
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so that 



S, .Sj = l S+Sr + \ S7S+ + 5f 5J (4.277) 
= 5fl t(l-^ + ...)(l-^ + ...)a, (4.278) 



+ 5^1-^ + ...ja i a]|l-^ + ...j+(a^-5)(a].a j -S) 
= S 2 + S[a\ aj + aJ-Oi - a\ ai - a]a^j + {at^at^ - \ a\a\a iaj 

~ \ a H a 3 a 3 ~ i M a i a i - i a 5 a k-°i} + °(V5) • (4-279) 

Note that a systematic expansion in powers of 1/5 can be performed. The Heisenberg 
Hamiltonian now becomes 



classical ground 

state energy 0(S 2 ) spin-wave Hamiltonian H sw spin-wave 

inter (\ (. 1 1 oiis 



H= -S 2 Y,Jii +S^J i j{a\a i + a)a j -a\a j -a)a % )+ O(S ) . (4.280) 

i<j i<j 

We assume our sites are elements of a Bravais lattice, and we Fourier transform: 

«i = "4 E e+tq ' Rl % 4 = -Tff E 4 (4-281) 



= E e ~^ fi! 4 = E e+ ^* 1 4 • ( 4 - 282 ) 



Note that the canonical commutation relations are preserved by this transformation: 

Ka]] = S ij Ka\,\ = 5 qq) . (4.283) 

Using the result 

^E e%_9 ' ) Hi = V > ( 4 -284) 

i 

we obtain the spin-wave Hamiltonian 

n sw = S [AO) - J(q)] a\a q , (4.285) 

from which we read off the spin-wave dispersion 

Hu q = S[J(0)-J(q)] (4.286) 

= ls[^2j(R)R 2 ]q 2 + 0(g 4 ) . (4.287) 
R 

The above sum on R converges if J(R — ► oo) ~ ^~( rf + 2 + e ) with e > 0. 
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4.10.2 Static Correlations in the Ferromagnet 

The transverse spin-spin correlation function is 

(St SJ) = (a\ (2S - a\ ai ) 1 ' 2 (2S - a) aj ) l/2 a,-) (4.288) 
= 2S (at 0j ) + O(S ) (4.289) 

/jdu p ik-{Rj-Ri) 
4^7, • (4-290) 

n 

The longitudinal spin-spin correlation function is 

(SfS*) - (St) <S|) = (4a t a] aj ) - (a\a % ) (a)^) = O(S ) . (4.291) 

Note that the average spin polarization per site is 

(S?) = -S+(a\aJ (4.292) 

= -S + n[ 1 f^ 1 . (4.293) 

n 

Now as k — > the denominator above vanishes as k 2 , hence the average spin polarization 
per site diverges when d < 2. This establishes a "poor man's version" of the HMW theorem: 
as infinite spin polarization is clearly absurd, there must have been something wrong with 
our implicit assumption that long-ranged order persists to finite T. In d = 3 dimensions, 
one finds (Sf) = -S + 0(T 3 / 2 ). 



4.10.3 Antiferromagnetic Spin Waves 

The case of the ferromagnet is special because the classical ground state | F ) is in fact a 
quantum eigenstate - indeed the ground state - of the ferromagnetic Heisenberg Hamil- 
tonian. 9 In the case of the Heisenberg antiferromagnet, this is no longer the case. The 
ground state itself is a linear combination of classical states. What is the classical ground 
state? For an antiferromagnet on a bipartite lattice, 10 the classical ground state has each 
sublattice maximally polarized, with the magnetization on the two sublattices oppositely 
oriented. Choosing the axis of polarization as z, this means Sf = — S is i £ A and Sf = +S 
if i G B. We'll call this state | N), since it is a classical Neel state. 

Let is assume that the lattice is a Bravais lattice with a two-element basis described by 
basis vectors and d. Thus, if R is any direct lattice vector, an A sublattice site lies at R 

9 Of course, |F) is also an eigenstate - the highest lying excited state - of the antiferromagnetic Heisenberg 
Hamiltonian. 

10 A bipartite lattice is one which may be decomposed into two sublattices A and B, such that all the 
neighbors of any site in A lie in B, and all the neighbors of any site in B lie in A. Examples of bipartite 
lattices: square, honeycomb, simple cubic, body-centered cubic, hexagonal. Examples of lattices which are 
not bipartite: triangular, Kagome, face-centered cubic. 
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and a B site at R + S. The Heisenberg Hamiltonian is written 

H = ~ E { \ J aa(^ " R ') S a(R) ■ Sa(R') + \ J BB (R - R') S B (R) ■ S B (R') (4.294) 

R,R' l ~ 

+ J AB (R- R! - 5) S A (R) ■ S B (R')\ . 

Here S A {R) represents the spin on the A sublattice located at position R, while S B (R) 
represents the B sublattice spin located at R + d. The factor of \ multiplying the J AA and 
J BB terms avoids double-counting the AA and BB interactions. The Neel state will be the 
classical ground state if J AA > and J BB > and J AB < 0. It may remain the ground 
state even if some of the interactions are frustrating, i.e. J AA < 0, J BB < 0, and/or J AB > 
between certain sites. 

We'd like the Neel state | N) = | TltlT • • • ) to be the vacuum for the Holstein-Primakoff 
bosons. To accomplish this, we rotate the spin operators on the B sublattice by tt about 
the i)-axis in the internal SU(2) space, sending S x — > — S x , S y — ► S y , and S z — > — S z . In 
the language of HP bosons, we have the following: 

A Sublattice B Sublattice 

S + = a){2S -cJa) 1 / 2 S + = -(25 - tfb) 1 ^ (4.295) 

S- = (2S - a)a) 1/2 a S~ = -6 t (25 - tfb) 1/2 (4.296) 

S z = a ^a-S S z = S- tfb (4.297) 

We may now write the Heisenberg interaction as an expansion in powers of 1 /S: 

S A {R) ■ S A (R') = S 2 + S (<4 a Rf + a} RI a R - a R a R - a R , a R ) + 0(S°) (4.298) 
S B (R) ■ S B (R') = S 2 + S (b R b RI + tf RI b R -b R b R - b RI b R ) + O(S ) (4.299) 
S A (R) ■ S B (R') = -S 2 + S (a R a R + b R b R - a R b RI - a R b R/ ^j + O(S ) . (4.300) 

Thus, the classical ground state energy is the 0(S 2 ) term, 

E * = 52 E { " \ j aa(^ -R r )~\ Jbb(R - R') + Jab(R -R'-S)} . (4.301) 

R,R' 

The spin- wave Hamiltonian is the 0(S 1 ) piece, 

H sw = SJ2{ J aa(R - R') (4 a RI - a R a R ) + J BB (R - R') (b R b RI - b R b R ) 
+ Jab(R -R'-S) (a R a R + b R b R - a R b RI - a R b R ,^j | . 
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We now Fourier transform: 

n - 1 ST „+ik-R 

(in — — z= > e ' 



t 


1 


R — 




L- 


1 




\/iV 



(4.302) 
(4.303) 



which leads to 



fc,fc' R,R' 

= ^J AB (fc)4a fe (4.304) 

= ^J AB (fc)46L fc , (4.305) 
fe 

where, assuming J AA , J BB and J AB are functions only of the magnitude of their arguments, 

J AA (k)^^J AA (\R\)e^ R (4-306) 
R 

j BB (k)^Y. J ™(\ R \) etk ' R 

R 

J AB (k)^^2j AB (\R + 8\)e^ R ^ . 
R 

Note that J AA (k) = J AA (-k) = [J AA (k)]* (similarly for J BB ), and J AB (fc) = [J AB (-fc)]*. 
The spin-wave Hamiltonian may now be written as 

n sw = SY, { (4,(0) - J AA (k) - J AB (0)) 4a fe + (J BB (0) - J BB (k) - J AB (0)) b{b k 
k I 

+ J AB (fc)4&L fc + J^(fc)a fc &_ fc J . (4.309) 

In other words, 



(4.307) 
(4.308) 



H sw = £ {nj^ 4 a fc + 4 6 fc + A fc 4 bl k + A* a fc 



with 



n A k A = s[J AA (o)- J AA (fc) - i AB (o)) 
fifc B = 5(j BB (o)- i BB (fe)- j ab (o)) 



(4.310) 

(4.311) 
(4.312) 
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and 

A k = SJ AB (k) . (4.313) 
Henceforth we shall assume J AA (R) = J BB (R), so fi£ A = Q k B = fi fc . 

Note that the vacuum | ) for the a and b bosons is not an eigenstate of H sw , owing to the 
spin- wave pair creation term A* k a k b_ k . This can be traced back to the effect on the Neel 
state of the Heisenberg interaction, 

S t -Sj = l St S- + IS-S+ + S? SJ . (4.314) 

If i G A and j G B, then the term S^ S~ acts on the configuration | — S , +S ) and converts 
it to 2S | — S + 1 , S — 1) . Nevertheless, we can diagonalize 7i sw by means of a canonical 
(but not unitary!) transformation, known as the Bogoliubov transformation. Note that for 
each k G Cl, the spin-wave Hamiltonian couples only four operators: a k , a k , b^_ k , and b_ k . 
We write the Bogoliubov transformation as 

a k = u k a k- v K 3 -k h -k = u kP-k- v k a k ( 4 - 315 ) 

4 = 4 4 - v k P- k b -k = 4 0-k - v k a k ( 4 - 316 ) 

One can readily verify that this transformation preserves the canonical bosonic commutation 
relations, 

K 4'] = K 4] = k> 4] = 4'] = <W (4-317) 

provided that 

4 u k ~ v k v k = 1 • ( 4 - 318 ) 

The inverse transformation is 

«* = 4 a k + 4 6 4 ^- fc = 4 + 4 4 ( 4 - 319 ) 

4 = % 4 + « fc = u k b -k +v k a k . (4.320) 

We'll write 

u k = exp(ir] k ) cosh(# fe ) , v k = exp(-ir] k ) sinh(0 fc ) . (4.321) 
We may then write 

a k = exp(ir/ fc ) cosh(# fc ) a k - exp(ir} k ) sinh(0 fe ) $_ k (4.322) 

b_ k = exp(i Vk ) cosh(# fc ) / k - exp(i7] k ) sinh(0 fe ) a[ (4.323) 

as well as the inverse 

a k = exp(-i Vk ) cosh(6y a k + exp(ir ?fc ) sinh(# fc ) $_ k (4.324) 
= exp(-ir/ fc ) cosh(0 fe ) (3_ k + exp(ir ?fc ) sinh(# fc ) a\ . (4.325) 
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Substituting into the expressions from H sw , we find 

V k (4 H + b l K) = cosh(2# fc ) (4 a k + (3l k I , + 1) - n k 

~ V k sinh(2^) (4 Pl k + a k (3_ k ) (4.326) 
A k 4 «U + A l H b -k = - 1 A k I sinh(2£ fe ) (4 « fc + /3l fc + l) 

+ |A fc | cosh(20 fc ) {alpl k + a k P_ k ) , (4.327) 

where we have taken r] k = ^arg(A fe ). Up until now, 9 k has been arbitrary. We now use 

this freedom to specify 6 k such that the (4<^-fc + a kfi-k) terms vanish from H sw . This 
requires 

|A fc | cosh(2# fc ) - n k sinh(20 fc ) = =j- tanh(20 fc ) = ^ , (4.328) 
which means 

cosh(2# fe ) = ^| , S inh(2^) = ^l (4.329) 
along with the dispersion relation 



E k = ^l-K\ 2 ■ (4-330) 
Finally, we may write the diagonalized spin-wave Hamiltonian as 

H sw = ^E k (4 a k + Pi k ) +J2( E k~ «*) • ( 4 -331) 



Note that E k = since J AB (k) = J AB (—k). The two terms above represent, respectively, 
the spin-wave excitation Hamiltonian, and the 0{S 1 ) quantum correction to the ground 
state energy. Since E k < this correction is always negative. 

As k — ► 0, we have, assuming cubic or higher symmetry, 

n k = S £ J AB (\R + S\) + IS fc 2 J aa (I HI) R 2 + ■ • • (4-332) 

R R 

= SW + SXk 2 + ... 

A k = +sY,J A n(\R + ^\)-lSk^^(\R + ^\)\R + s\ 2 + ... 

R R 

= -SW + SYk 2 + ... . (4.333) 



The energy dispersion is linear: E k = hc\k\, where c = Sy/2W(X + Y). Antiferromag- 
netic spin waves are Goldstone bosons corresponding to the broken continuous symmetry 
of global spin rotation. The dispersion vanishes linearly as k — > 0, in contrast to the case 
of ferromagnetic spin waves, where E k vanishes quadratically. 
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Reduction in Sublattice Magnetization 



Let's compute the average of S z for a spin on the A sublattice: 



(S z (R)) = -S+(a R a R ) 



k 




BZ 



(27r)*l£ fc exp(£7 fc /fc B T)-l 2 \E k 



u l a k ~ v k P-k) K a k ~ v *k P-k)) 



n k i i fn k 




(4.334) 



where v is the Wigner-Seitz cell volume, and the integral is over the first Brillouin zone. 
The deviation 5S Z = (a) a) from the classical value (S z ) = —S is due to thermal and 
quantum fluctuations. Note that even at T = 0, when the thermal fluctuations vanish, 
there is still a reduction in sublattice magnetization due to quantum fluctuations. The Neel 
state satisfies the SfSj part of the Heisenberg interaction, but the full interaction prefers 
neighboring spins to be arranged in singlets, which involves fluctuations about local Neel 
order. 

We've seen that Q k ~ SW and E k ~ hc\k\ as k — > 0. Thus, the integrand behaves as 
T/k 2 for the first term and as l/\k\ for the second term. The integral therefore diverges in 
d < 2 at finite T and in d = 1 even at T = 0. Thermal and quantum fluctuations melt the 
classical ordered state. 



4.10.4 Specific Heat due to Spin Waves 

The long wavelength dispersion u q = Aq 2 has thermodynamic consequences. Consider a 
general case of a bosonic dispersion tu q = A\q\ a . The internal energy for a system in d space 
dimensions is then 




(4.335) 



o 



where n d = 2n d / 2 /T(d/2) is the area of the unit sphere in d dimensions. Thus, E(T) 
T 1+ ^, leading to a low-temperature heat capacity of 



oc 




(4.336) 
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At high T, one must impose a cutoff at the edge of the Brillouin zone, where k ~ vr/a, in 
order not to overcount the modes. One finds 

E(T) = k B Tvj = Nk B T , (4.337) 
n 

where N is the number of unit cells. This simply is the Dulong-Petit result of k B T per 
mode. 

For ferromagnetic spin waves, we found a = 2, hence C y oc T d l 2 at low temperatures. As 
we shall see, for antiferromagnetic spin waves, one has a = 1, as in the case of acoustic 
phonons, hence C y oc T d . 

Suppose we write the long- wavelength ferromagnetic spin- wave dispersion as Hto q = CJ(qa) 2 , 
where a is the lattice spacing, J is the nearest neighbor exchange, and C is a dimensionless 
constant. The ferromagnetic low-temperature specific heat is then 

^=r ( 2 + i (i )C(l + I (i )^(^) d/2 , (4.338) 

hence C£ oc (T/Qj) d / 2 , with @j = CJ/k B . Acoustic ph onons with a u k = hc\k\ dispersion 
lead to a Debye heat capacity 

CV° = r(2 + cOC(l + cO|^(££f, (4.339) 

hence C D oc (T/0 D ) d , with D = hc/ak B . Thus, at the lowest temperatures, the specific 
heat due to spin waves dominates, but at intermediate temperatures it is the phonon specific 
heat which dominates. The temperature scale T* at which the two contributions are roughly 
equal is given by 

(T*/e j ) d/2 ~ {T*/e D ) d => T*~@l/@j. (4.340) 



